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ABSTRACT 


Architectures  of  computer  systems  based  on  Data  Flow  (DF)  concepts  attracted 
great  attention  as  an  alternative  to  conventional  sequential  architectures  (Von  Neumann). 
DF  architectures  are  capable  of  efficiently  exploiting  a  massive  amount  of  parallelism 
inherent  in  many  types  of  computation.  They  are  programmed  using  directed  graphs 
whose  vertices  are  function  modules  and  whose  edges  denote  data  dependencies  between 
function  modules.  An  important  subclass  of  DF  is  Large  Grain  Data  Row  (LGDF)  which 
is  efficiently  used  in  computation  intensive  applications,  such  as  signal  processing.  Pres¬ 
ently,  most  leadoffs  incorporate  nondeterministic  run-time  technique  to  allocate  system 
resources  to  support  the  execution  (One  such  technique  could  be  First  Come  First  Served). 
Despite  of  the  usual  simplistic  nature  of  scheduling  techniques  which,  results  in  a  low  run¬ 
time  overhead,  the  system  throughput  and  predictability  could  rapidly  degrade  under  high 
system  load.  To  provide  uniform  output  and  improve  the  resource  usage  even  under  a  high 
load,  a  compile-time  technique  called  Revolving  Cylinder  (RC)  was  introduced.  In  this 
thesis,  we  present  a  LGDF  simulator  and  a  Graph  restructurer  that  restructures  the  given 
graph  according  to  the  RC  technique.  We  then  perform  a  comparative  experimental  study 
of  the  different  implementation  of  RC  and  the  FCFS  scheduling  techniques.  Our  results 
demonstrate  that  there  is  a  high  potential  for  the  RC  technique,  if  a  satisfactory  node  map¬ 
ping  technique  is  developed. 


IV 


TABLE  OF  CONTENTS 


I.  INTRODUCTION . 1 

A.  DATA  FLOW  ARCHITECTURES  . 1 

B.  OBJECTIVES . 2 

1.  Scope  of  the  Thesis  . 2 

C.  THESIS  ORGANIZATION  . 3 

II.  BACKGROUND . 4 

A.  REVOLVING  CYLINDER  ANALYSIS . 4 

B.  IMPLEMENTATION  OF  RC . 10 

C.  POTENTIALS  OF  THE  RC  ANALYSIS  . 1 1 

IB.  SIMULATOR . 12 

A.  PROGRAM  MODEL  . 12 

B.  MACHINE  MODEL  . 15 

1.  Processors . 15 

a.  Generic  Processors  (GPs) .  15 

b.  Input/Output  Processors  (IOPs) . 1 6 

2.  Global  Memory  Modules(GMMs) . 16 

3.  Scheduler . 17 

C.  SIMULATOR  DESCRIPTION .  18 

D.  SIMULATOR  PROGRAM  DATA  STRUCTURE . 22 

E.  USER  INTERFACE . 22 

1.  Input . 22 

2.  Output . 24 

F.  EXAMPLE  RUNS  OF  THE  SIMULATOR  . 25 

IV.  GRAPH  RESTRUCTURING  . 28 

A.  CYLINDER  MAPPING . 30 

B.  INDEX  ASSIGNMENT . 3 1 

C.  SYNCHRONIZATION  ARCS  CREATION . 32 

D.  PERFORMANCE  EVALUATION . 35 

1.  Basic  Performance  Measurements . 38 

2.  Evaluating  the  Effect  of  Memory  Mapping . 40 

3.  Effect  of  the  Queue  Size  on  the  System  Throughput . 42 


r 


V.  CONCLUSION  REMARKS . 43 

A.  CONCLUSIONS  . 43 

B.  FUTURE  WORK  . 43 

APPENDIX  A:  Sample  Graph  File . 45 

APPENDIX  B:  Simulator  Source  Code . 48 

APPENDIX  C:  Graph  Restructure  Source  Code . 135 

APPENDIX  D:  Interrelation  of  the  Files  in  PIPDAFS  and  GR . 162 

REFERENCES . 163 

INITIAL  DISTRIBUTION  LIST . 164 


vi 


I.  INTRODUCTION 


A.  DATA  FLOW  ARCHITECTURES 

The  Data  Flow  (DF)  architecture  is  an  alternative  to  Von  Neumann  architecture  and  is 
capable  of  exploiting  a  massive  amount  of  parallelism  inherent  in  many  types  of 
computation.  In  the  DF  model  of  computation,  a  program  is  represented  by  a  directed  graph 
in  which  the  nodes  denote  operations  and  the  arcs  connecting  them  represent  data 
dependencies.  Nodes  become  ready  when  their  operands  arrive,  thus  computations  are 
data-driven.  The  DF  model  supports  concurrent  execution  of  ready  nodes.  In  this  model, 
execution  of  nodes  is  asynchronous  because  of  its  data-driven  nature.  Also,  computations 
are  free  from  side  effects.  There  is  no  notion  of  shared  data  storage  and  results  are  conveyed 
directly  by  means  of  data  arcs,  [KARP  66].  These  properties  imply  that  multiprocessor 
architectures  based  on  DF  model  need  not  suffer  from  the  synchronization  and  coordination 
overheads  incurred  in  Von  Neumann  architectures. 

Depending  upon  the  chosen  granularity,  DF  architectures  can  be  categorized  in  two 
groups:  Large  Grain  Data  Flow  (LGDF)  and  Fine  Grained  Data  Row  (FGDF) 
architectures.  The  granularity  of  nodes  is  crucial  to  the  effectiveness  of  multiprocessing. 
For  a  given  application,  the  larger  the  node  grain  size,  the  smaller  the  degree  of  parallelism 
that  can  be  exploited.  However,  the  larger  the  grain  size,  the  smaller  the  amount  of 
communication  overhead  that  is  incurred.  Thus,  fine  granularity  does  not  necessarily  imply 
better  performance,  because  it  also  increases  communication  overhead,  which  affects  the 
parallelism  exploited.  The  choice  of  granularity  can  be  influenced  by  software  engineering 
considerations  as  well  as  by  parallel  processing  considerations,  [LEE  89]. 

Real-time  compute-intensive  applications  require  predictable  response  time  and  high 
performance  as  measured  by  throughput.  Satisfying  response  time  and  throughput 
requirements  are  critical  for  the  correct  functioning  real-time  applications.  The 
predictability  of  both  response  time  and  throughput  can  be  influenced  by  resource 
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allocation  and  communication  overhead.  Resource  allocation  and  communication 
overhead  can  be  controlled  at  compile-time  and  at  run-time  to  lead  to  high  throughput  and 
deterministic  response  time. 

Based  on  how  a  graph  node  and  arc  attributes  are  used  at  compiie-time  and  how  much 
control  information  is  generated  to  aid  the  run-time  mechanism,  DF  scheduling 
implementations  can  be  classified  as  fully  dynamic,  self  timed,  static,  fully  static.  Fully 
dynamic  allocation  performs  all  scheduling  of  nodes  at  run-time  based  on  the  readiness  of 
inputs  and  resource  availability.  In  a  self  timed  allocation  system,  compiler  determines  the 
order  of  the  node  execution  and  allocates  resources,  but  execution  of  the  nodes  is 
determined  at  run-time  by  data  arrival.  Static  node  allocation  involves  the  assignment  of  a 
node  to  a  processor,  but  the  order  of  execution  is  left  up  to  run-time  scheduler  based  on  the 
node’s  input  data.  In  fully  static  allocation,  the  compiler  determines  the  exact  execution 
time  assignment,  and  ordering  of  nodes  based  on  that  node’s  predicted  behaviour,  [LEE 
90], 

B.  OBJECTIVES 

First-Come-First-Served  (FCFS)  is  a  scheduling  strategy  where  the  ready  node 
primitives  are  ordered  by  the  time  they  become  ready.  High  system  loads  may  cause 
memory  contention  and  imperfect  computation/communication  overlap  resulting  in  a 
degradation  of  throughput  and  unpredictable  response  time.  The  FCFS  strategy  does  not 
exert  any  run-time  control  to  solve  the  above  mentioned  problem. 

In  this  thesis,  compile-time  analysis  of  LGDF  graphs  is  carried  out  using  the 
Revolving  Cylinder  (RC)  technique.  RC  restructures  the  original  graph  to  obtain  high 
throughput  and  predictable  response  time,  [SLZ  92].  Performance  analysis  of  results 
obtained  by  simulation  is  carried  out  to  compare  merits  of  different  approaches. 

1.  Scope  of  the  Thesis 

In  [LIT  91]  and  [SLZ  92],  the  RC  approach  to  determine  the  node  execution 
sequence  was  first  suggested  to  enhance  the  system  throughput. 
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This  thesis  refines  the  RC  approach  and  its  scope  is  analysis  of  previous  work 
[LIT91]  [BELL  92], development  of  an  event  driven  simulator  (PIPDAFS)  for  LGDF 
machines,  and  finally  comparison  of  techniques  for  graph  restructuring  using  the  RC 
approach. 

C.  THESIS  ORGANIZATION 

Chapter  II  reviews  the  RC  approach  and  the  previous  work  accomplished  in  the  area. 
In  Chapter  III,  we  present  PIPDAFS  (Periodic  Inputs  DAta  Flow  Simulator)  and  sample 
runs. In  Chapter  IV,  we  discuss  and  compare  the  graph  restructuring  techniques  and 
analysis  of  the  experimented  results.  In  Chapter  V,  we  conclude  from  the  results  along  with 
suggestion  for  future  work  to  be  done. 
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II.  BACKGROUND 


A.  REVOLVING  CYLINDER  ANALYSIS 

The  Revolving  Cylinder  (RC)  technique  for  determining  the  node  execution  sequence 
performs  compile-time  analysis  and  results  in  the  restructuring  of  the  graph,  [LIT  91],  [SLZ 
92].  RC  technique  restructures  the  application,  described  by  a  LGDF  graph  by  using  the 
machine  configuration  and  the  application  graph  as  input. 

The  first  step  towards  restructuring  is  to  layout  the  graph  nodes  in  a  schedule  that 
achieves  some  purpose  (e.g.  no  nodes  reading  or  writing  from  the  same  memory  modules 
at  the  same  time).  The  graphs  that  we  deal  with  are  Directed  Acyclic  Graphs  (DAG).  We 
benefit  from  research  done  on  fine  grained  scheduling  of  parallel  loops. 

Loop  scheduling  literature  [RAU  81]  [HSU  86],  has  dealt  with  the  problem  of 
scheduling  multiple  iterations  of  the  same  parallel  loop  (represented  by  a  DAG)  to  saturate 
the  available  resources.  The  resultant  schedules  possess  a  minimal  initial  cost  for  iterations, 
while  possibly  extending  the  completion  time  of  each  iteration. 

Our  scheduling  technique  can  benefit  from  these  results  by  noting  that  the  real-time 
applications  that  we  deal  with  require  multiple  instantiations  of  a  DAG  each  of  whose 
nodes  is  a  large-grained  primitive.  These  multiple  instantiations  have  similarities  with  the 
execution  of  parallel  iterations  of  a  loop.  Based  on  that,  we  drive  the  following  scheduling 

principle1: 

“The  schedule(s)  providing  the  maximal  throughput  with  minimal  initiation  interval 
for  a  DAG  depend  only  on  the  resource  requirements  of  the  DAG  nodes  and  not  on  the 
topology  of  the  DAG.  " 


I.  This  principle  can  be  obviously  deduced  from  the  work  of  [RAU  81]  and  [HSU  86}  although  it  was  not 
explicitly  mentioned  there. 


4 


This  principle  simply  states  that  schedules  with  the  same  throughput  can  be  found  for 
different  DAGs  as  long  as  they  have  similar  sets  of  nodes  with  respect  to  the  resource 
requirements.  As  a  result  of  this  principle,  it  might  be  worthwhile  to  reduce  the  DAG  to  be 
scheduled  to  an  equivalent  DAG  with  the  same  set  of  nodes  \  and  an  empty  set  of  edges  E 
before  scheduling  it  as  will  be  shown  below. 

The  following  examples  should  clarify  the  principle: 


The  DAG  I,  DAG  II  and  DAG  III  are  three  graphs  with  same  node  requirements  and 
different  edges.  Then  according  to  the  above  principle,  they  can  have  the  same  schedule 
which  produces  the  same  throughput.  Two  example  of  the  many  possible  schedules  for 
these  graphs  are  shown  in  Figure  2.2a  and  Figure  2.2b,  (The  schedules  assume  two 
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processors).These  examples  Jepict  the  node  execution  pattern  that  is  induced  by  a 
schedule.  The  nodes  might  belong  to  different  instances.  The  exact  instance  to  which  a  node 
belongs  depends  on  the  set  of  edges  (that  has  been  ignored  so  far)  are  in  tables  1,2,3.  We 
say  nodes  in  the  schedules  with  the  indices  of  the  instances  they  belong  to.  The  above 
principle  reduces  optimal  throughput  of  a  DAG  scheduling  to  a  bin-packing  problem. 
While  the  set  of  edges  in  a  DAG  has  no  effect  on  the  potential  throughput  of  the  DAG,  it 
will  affect  the  instance  response  time  associated  with  any  schedule. 


Figure  2.2.  Two  possible  schedules  for  graphs  with  two  processor 
The  schedules  can  be  enforced  as  it  is  depicted  Table  1,  Table2  and  Table  3 

respectively. 

Table  1 :  Compact  Representation  of  the  schedule  1  on  DAG  1 
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Table  2:  Compact  Representation  of  schedule  II  on  DAG  I 


PI 

P2 

Ci-1 

ei-2 

Ci-1 

ei-2 

fi-3 

ei-2 

fi-3 

ei-2 

di-l 

di-l 

bi 

Table  3:  Compact  Representation  of  schedule  I  on  DAG  III 


PI 

P2 

q 

fi 

q 

fi 

ei 

Ci 

q 

q 

di 

bi 

di 

Consider  the  Tables  1  and  2  both  of  which  show  an  indexed  version  of  schedule  I  and 
II  on  DAG  I.  The  corresponding  schedule  of  one  instance  of  DAG  I  using  the  Tables  1  and 
2  are  shown  in  Figure  2.3a  and  Figure  2.3b  respectively.  It  is  clear  that  these  are  different 
schedules  possessing  the  property  that  they  yield  the  same  throughput. 

The  algorithms  for  assigning  indices  to  the  nodes  in  a  schedule  and  to  synchronize  the 
nodes  so  as  to  be  faithful  to  the  schedule  are  the  main  purpose  of  “RC  Scheduling”.  These 
algorithms  can  be  found  in,  [SLZ  92]. 

The  name  “Revolving  Cylinder”  reflects  a  way  looking  at  the  schedule  by  wrapping 
the  nodes  around  a  cylinder,  thereby  causing  its  end  to  meet  its  beginning.  For  each  node 
in  the  original  graph,  with  the  top  and  working  toward  the  bottom,  attempt  to  schedule  the 
node  at  its  earliest  start  time  If  it  can  not  be  inserted  at  that  time,  delay  the  start  time  by  the 
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width  of  a  slot  and  repeat  until  it  can  be  inserted.  The  earliest  start  time  of  all  descendants 
of  that  node  and  repeat  the  above  sequence  with  the  next  node  as  the  top  node  in  the  graph, 
[LIT  91]  and  [SLZ  92], 
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Figure  2.3.  Execution  cycles  of  the  schedule  I  and  schedule  II  on  DAG  I 
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In  Figure  2.4,  the  execution  of  RC  belonging  to  the  DAG  I  is  shown.  Each  node  of  the 
graph  occupies  a  portion  of  the  cylinder  equal  to  its  execution  time.  And  a  new  instantiation 
could  be  started  every  six  cycles,  when  two  processor  are  used.  Thus  another  instance  of 
the  graph  can  be  overlapped  with  the  first  instance  after  six  cycles.  To  prevent  any  conflict 
on  the  graph  execution  when  instances  are  overlapped,  nodes  are  assigned  indices.  For  the 
example  presented  in  Table  1,  ej  can  not  be  executed  at  the  same  time  as  aj  however,  eg 
can,  [SLZ  92], 


Figure  2.4.  Execution  of  DAG  I  with  Schedule  I 
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B.  IMPLEMENTATION  OF  RC 


Once  all  nodes  have  been  inserted  into  the  cylinder,  then  rode  indices  based  on  their 
location  on  the  cylinder  can  be  determined.  Figure  2.5  depicts  the  node  indices  of  DAG  I 
on  schedule  I. 


PI  P2 


a0 

e  -l 

f-2 

c0 

d-1 

b0 

Figure  2.5.  Node  indeces  of  DAG  I  with  schedule  I. 

Dependency  arcs  are  created  by  using  these:  indices.  The  dependency  arcs  belonging 
to  DAG  I  according  to  the  schedule  I  are  shown  in  Figure  2.6.  Detailed  explanation  of  node 
index  assignment,  and  dependency  arc  creation  is  given  in  Chapter  IV. 
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The  labels  on  each  dependency  arcs  indicates  the  initial  tokens,  threshold  and 
consumed  amount  of  sink  node  respectively. 

C.  POTENTIALS  OF  THE  RC  ANALYSIS 

The  RC  technique  of  restructuring  the  application  provides  an  improvement  that  the 
dependencies  enforce  node  execution  in  order  to  provide  more  throughput  and  predictable 
response  time.  Without  any  extraneous  control  the  FCFS  scheduling  can  provide  uniform 
throughput  The  nodes  receiving  external  data  are  ready  for  execution  independent  of  the 
status  of  other  nodes  in  the  graph.  If  external  data  arrives  more  frequently  than  the 
execution  frequencies  of  the  lower  nodes  of  the  graph,  they  fall  behind  the  upper  nodes  of 
the  graph.  This  results  in  the  upper  nodes  output  queues  going  overcapacity,  preventing 
them  from  entering  the  ready  node  list,  [POPS  90]. 

It  is  possible  to  enforce  the  execution  order  loosely.  Enforcing  the  cylinder  loosely 
would  make  the  system  fully  dynamic  where  the  nodes  are  scheduled  at  run  time  only.  It  is 
preferable  to  run  the  system  in  fully  dynamic  mode.  The  RC  technique  and  subsequent 
restructuring  simply  enhance  the  fully  dynamic  mode. 

It  is  also  possible  to  enforce  the  execution  order  strictly  which  would  make  the  system 
fully  static.  For  each  node  of  the  graph  a  specific  processor  and  exact  time  to  begin 
processing  can  be  given  from  the  cylinder.  This  can  be  enforced  directly  by  the  scheduler 
to  yield  the  fully  static  mode.  But  the  running  the  system  in  the  fully  static  mode  is 
unwarranted.  This  mode  of  operation  can  be  limited  to  the  machines  which  do  not  have  a 
dedicated  run-time  scheduler.  The  failure  of  a  single  processor  will  crash  the  whole  system. 
This  is  unacceptable  in  a  real-time  system.Also  since,  all  processors  are  assumed  to  be 
identical,  it  becomes  unnecessary  to  assign  a  specific  pocessors  to  a  specific  node.  If  a  node 
is  ready  for  execution,  it  can  be  assigned  to  the  first  available  processor.  This  will  reduce 
the  amount  of  time  a  node  waits  for  a  processor  in  the  fully  static  case,  and  provides 
flexibility  and  optimal  utilization  of  system  resources,  [LEE  90]. 
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III.  SIMULATOR 


A.  PROGRAM  MODEL 

The  input  to  Periodic  Input  Processing  DAta  Flow  Simulator  (PIPDAFS)  is  a 
directed  graph.  Nodes  are  the  computation  to  be  performed  on  the  input  data. 

Data  passing  l;nks  between  the  nodes  are  FIFO  queues.  Each  node  reads  data  from  its 
input  queue,  performs  a  nontrivial  computation  and  writes  the  produced  data  to  the  output 
queues.  A  node  is  assured  to  carry  no  history.  An  example  of  a  program  graph  which 
PIPDAFS  will  simulate  is  shown  in  Figure  3.1. 


The  functionalities  of  the  nodes  are  not  simulated  in  PIPDAFS,  only  the  resource 
requirements  and  computation  times  are.  For  each  node  and  queue  following  quantities  are 
prespecified  and  is  fixed  across  different  invocation  of  the  graph. 

Node  Execution  Time  (Ej):  It  is  the  execution  time  of  node  i  excluding  any 
communication  and  synchronization  overhead. 

Data  Production  Amount  (Pj):  It  is  the  amount  of  data  that  node  i  produces  to  its 
output  queues  for  every  invocation. 
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Data  Consumption  Amount  (Qj):  It  is  the  amount  data  that  node  i  consumes  from 


its  input  queues  for  each  reading. 

Data  Threshold  Amount  (Rj  j):  It  is  the  least  amount  of  data  to  be  present  in  queue 
(i,j),  so  that  node  j  can  start  execution. 

Data  Capacity  Amount  (C,*j):  It  is  the  maximum  amount  of  data  queue  (i.j)  can 

store. 

When  a  source  node  finishes  its  execution,  it  starts  to  write  results  to  the  output 
queues,  output  queues  current  lengths  are  incremented  by  production  amount  of  source 
node.  If  current  length  of  the  queue  is  greater  than  threshold  quantity  then  queue  is  said  to 
be  over_threshold.  Queues  can  not  be  overcapacity  since  source  nodes  which  will  cause 
overcapacity  can  not  be  ready.  Input  queue  sizes  are  updated  as  soon  as  a  node  consume  its 
data  from  that  queue.  When  data  is  consumed  the  queue  current  lengths  are  decreased  by 
data  consumption  quantity.  Input  data  for  a  node  are  queued  and  consumed  in  FIFO  order. 
Node  production  amount  can  be  different  from  the  consumption  amount  as  it  is  depicted  in 
Figure  3.1.  Queues  do  not  only  communicate  data,  but  also,  they  can  communicate 
synchronization  information. 


E,:  Execution  time  of  node  i 

P, :  Data  Production  Amount  of  node  i 

Q, :  Data  Consumption  Amount  of  node  i 

R^:  Data  Threshold  Amount  of  queue 

from  node  1  to  node  j. 

C^.Data  Capacity  of  the  queue  from 

node  i  to  node  j. 

Current  Length  of  queue  from 
node  i  to  node  j 


Figure  3.2.  A  sample  graph  node  with  its  associated  queues. 
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Assuming  the  node  N  in  Figure  3.2  whose  input  queues  are  numbered  from  1  to  n 
and  output  queues  are  numbered  from  1  to  m.Let  us  accurately  describe  the  conditions  in 
which  a  node  can  be  ready  to  be  executed: 

Queue  Current  Length  (Ljj):  Current  length  of  the  queue  (i,j). 

Queue  which  links  node  i  to  node  N,  is  overjhreshold  when; 

Ljjsj  >  Ri  jsj  for  alii,  1— >n 

Queue  which  links  node  i  to  node  N,  is  overcapacity  when; 

LjN  >  Cjjsj  for  all  i,  1  — >n 

When  node  setup  is  completed  each  input  queue  of  the  node  is  consumed. 

=  Lj jsj  -  Qn  for  all  i,  1  — >n 

When  a  node  is  executed  each  output  queue  of  the  node  is  written. 

Ljsj  j  =  Ljsf,j  +  Pi  for  all  j,  1— *m 

A  node  is  ready  to  execution  when  following  conditions  are  satisfied. 

i. )  L  ^  >  R ;  n  for  all  i,  1-An  (All  input  queues  are  over_threshold.) 

ii. )  L  nj+P  n  -  C  Nj  f°r  j>  l~»ni  (  All  output  queues  has  enough  space 
to  store  the  results.) 

iii.)  If  tn  is  the  completion  time  of  nth  instance  of  node  i,  then 

tn+l  >  t"  +  Ej  for  all  i,  1— »n  (A  node  can  not  have  multiple  instances 
executing  at  a  time.) 

Input  data  arrival  has  a  periodic  nature.  In  every  period  I/O  processors  execute 
I/O  nodes.  The  I/O  nodes  read  the  raw  data  from  the  external  units  such  as  sensors,  format 
it,  and  forward  it  to  its  output  queues.  If  the  data  arrival  rate  is  higher  than  I/O  node 
execution,  then  there  is  a  chance  of  having  new  data  overwrite  the  old  data  causing  a  data 
loss.  It  is  the  responsibility  of  application  programmer,  to  ensure  the  program  correctness 
by  either  choosing  a  computation  which  is  not  sensitive  to  mild  data  loss  or  by  choosing  an 
appropriate  rate  that  can  be  met  almost  always  by  different  machine  components. 
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B.  MACHINE  MODEL 

The  machine  model  we  assumed  is  based  on  three  functional  components:  the 
Scheduler(SCH),  Processors  (GPs,  IOPs),  Global  Memory  Modules  (GMMs).  As  data  for 
the  nodes  becomes  available,  each  node  must  be  scheduled  to  execute  on  a  processor.  The 
machine  model  can  be  seen  in  Figure  3.3,  and  its  functional  components  are  described 
below  in  details. 


1.  Processors 

a.  Generic  Processors  ( GPs) 

Based  on  the  schedule  signal  a  GP  will  read  the  primitive  code  and  the 
necessary  queue  data  from  the  GMMs  (set-up  stage),  it  will  execute  the  designated 
primitives,  and  write  the  output  queue  data  back  to  the  GMMs.(breakdown  stage).  A 
maximum  of  three  nodes  can  be  associated  with  an  GP  at  any  one  time 

i. )  One  node  being  setup. 

ii. )  One  node  executing. 

iii. )  Another  node  being  broken  down. 
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It  is  important  to  note  that,  if  a  node  is  being  broken  down,  a  node  assigned  to  set 
up  on  the  GP  can  not  start  to  set  up  (although  it  has  been  assigned  to  that  GP). 

Allowing  set  up  and  break  down  to  overlap  with  execution  is  the  main  mechanism 
for  allowing  computation-communication  overlap. 

A  processor  is  said  to  be  free  and  available  for  reassigning  by  the  scheduler,  if  the 
setup  stage  is  not  busy.  Needless  to  say  that  this  concept  of  a  free  processor  is  different  from 
the  classical  definition  of  what  a  free  processor  is.  A  processor  can  still  be  executing  and 
be  considered  free.  The  purpose  of  that  is  to  allow  for  maximum  computation  and 
communication  overlap. 

When  the  data  is  ready  for  an  internal  node,  SCH  sends  a  signal  to  GMM  to  send 
code  to  assigned  GP.  Each  GP  has  a  local  memory  which  capable  of  storing  the  primitive 
code  and  the  input  data.  As  soon  as  GP  acquires  the  code  seeks  input  data  from  GMMs. 
When  a  GP  finishes  node  set  up  informs  the  SCH  that  it  is  free. 

b.  Input! Output  Processors(IOPs) 

Input  nodes  periodically  receive  data  from  the  sensors.  IOPs  execute  the  input 
node  of  DFG  by  formatting  the  raw  data  and  writing  to  their  output  queues.  Output  nodes 
are  also  executed  at  IOPs  and  IOPs  redirect  the  ready  data  to  the  next  stage  in  the  system. 

When  the  data  is  ready  for  input  or  output  nodes,  SCH  sends  a  signal  to  an 
IOP  to  execute  th<*  specified  input  or  output  node.  IOPs  send  a  signal  to  GMMs  for  writing/ 
reading. 

2.  Global  Memory  Modules(GMMs) 

The  GMMs  provide  the  data  storage  for  the  machine.  These  GMMs  are  different 
from  a  typical  computer  memory,  since  each  is  proactive  (each  has  some  sort  of  a 
processor)  and  operates  independently.  Each  data  queue  is  allocated  to  a  single  GMM  for 
storage.  When  a  GP  starts  node  setup,  input  data  queues  are  consumed.  When  a  GP  finishes 
execution  all  the  output  queues  from  the  completed  node  are  produced  (written)  to  the 
appropriate  GMMs.  After  each  produce  and  consume  queue,  current  lengths  are  updated. 
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GMMs  check  for  over_threshold  or  overcapacity  conditions.  If  GMM  recognize  a  over_ 
threshold  informs  the  scheduler.  When  a  node  is  assigned  to  an  GP,  SCH  instructs  to  GMM 
to  write  primitive  code  to  the  GP’s  local  memory.  Upon  receiving  the  primitive  code  by  GP, 
it  asks  appropriate  GMMs  for  input  data.  GMMs  only  communicate  with  one  processor  at 
a  time.  If  there  are  requests  while  GMM  is  busy,  these  requests  are  waited  for  GMM  to  be 
available. 

3.  Scheduler 

In  this  model,  SCH  works  like  a  dispatcher  and  maintains  two  list  one  for  the 
ready  nodes  and  the  other  one  for  processors  which  indicates  the  free  processors.  SCH 
dispatches  the  ready  nodes  to  the  free  processors.  A  node  can  be  in  three  different  state 

i. )  Ready  node;  which  is  waiting  in  the  ready  list  to  be  assigned  to  a  free 
GP  to  be  executed. 

ii. )  Processing  node;  which  is  assigned  to  a  processor  and  is  in  one  of  the 
setup,  breakdown  or  execution  stages. 

iii.)  Waiting  Node;  which  is  waiting  for  input  data’s  being  ready  or  output 
queues  having  enough  space  or  both. 

A  node  becomes  ready,  if  all  of  its  input  queues  are  over  threshold  and  output 
queues  will  not  be  over_  threshold  after  the  node  execution.  When  a  node  becomes  ready, 
it  is  put  into  the  ready  node  list  and  then  SCH  attempts  to  match  a  free  processor  to  a  ready 
node.  When  a  processor  finishes  node  setup  and  start  to  node  execution,  it  is  indicated  as 
free  processor,  and  SCH  checks  the  ready  node  list,  if  ready  node  list  is  not  empty  then 
attempts  to  match  a  ready  node  to  the  free  processor. 

We  assume  that  SCH  runs  on  a  dedicated  processor.1 


1.  Alternatively  Scheduler  could  be  made  to  run  on  a  CP  with  a  high  priority. 
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C.  SIMULATOR  DESCRIPTION 

PIPDAFS  is  an  event  driven  simulator,  events  are  stored  in  a  queue  which  is 
prioritized  bv  their  time  stamps.  The  simulation  terminates  when  a  certain  number  of 
instances  <  prespecified  by  the  user)  has  been  executed.  The  event  queue  is  first  initialized 
with  events  denoting  readiness  of  the  input  nodes,  then  the  rest  of  the  events  are  produced 
from  these  initial  events.  An  instance  is  counted  as  started  when  one  of  the  input  nodes 
belonging  to  that  instance  has  started  to  setup.  An  instance  is  counted  as  completed,  when 
all  of  the  output  nodes  belonging  to  that  instance  have  finished  breakdown.  The  events  that 
may  occur  during  the  simulation  are  shown  in  Figure  3.4.  In  Figure  3.4  simulation  starts 
from  the  reach_producton_event  and  terminates  with  finish_breakdown  event. The  events 
are; 

1)  Reach_production period:  This  event  is  produced  periodically  by  the  external 
input  data  and  if  tf  2  output  queues  of  input  node  has  enough  space,  makes  the  input  nodes 
ready  to  execute.  This  event  produces  the  Inputqueuesoverthreshold  and  the 
Reach  production  period  events. 

2)  Inputqueuesoverthreshold :  This  event  indicates  that  all  input  queues  of  a  node 
are  over_threshold.  In  other  words,  input  data  is  available  for  node  execution.  While 
processing  this  event  output  queues  are  checked  whether  they  enough  space  to  store  the 
results  or  node  is  an  output  node.  If  the  output  queues  have  space  or  the  node  is  an  output 
node  then  the  Ready pode  event  is  produced  to  indicate  that  node  is  ready  to  be  executed. 

3)  Ready pode:  This  event  indicates  that  node  is  ready  to  be  executed  and  can  be 
put  into  the  ready  node  list  and  be  scheduled  to  a  free  processor.  If  a  previous  instance  of 
the  ready  node  is  currently  executing  then  according  to  the  policy  (defined  by  the  user 
before  simulation)  ready  node  may  be  put  into  the  ready  node  list,  but  not  executed  until 
previous  instance  has  been  completed  or  ready  node  not  even  put  into  the  ready  node  list. 
For  each  ready  node  added  the  Schedule  ppode JrompeadyJist  event  is  produced. 
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Figure  3.4.  Events  that  are  produced  during  the  simulation 
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4)  Schedule ja_nodeJrom_readyJist\  If  both  the  free  processor  list  and  the 
ready  node  list  were  not  empty.  This  event  attempts  to  match  a  ready  node  to  a  free 
processor.  If  the  attempt  is  successful,  it  produces  Start_setup  event. 

5)  Start_setup:  This  event  is  produced,  when  a  ready  node  matches  to  free 
processor  successfully.  If  the  node  is  the  first  input  node  of  a  new  instance,  then  it  will 
indicate  the  time  for  a  new  instance  start. 

6)  Start_reading_instruction_stream :  This  event  keeps  trying  to  read  the 
instruction  stream  until  the  memory  module,  where  instructions  are  stored  is  not  busy. 
Then  it  marks  the  memory  module  as  busy  and  produces  the 
Finish _readingJnstruction_stream  event. 

7)  Finish_readingJnstruction_stream:This  event  marks  a  memory  module  as 
not  busy  and  for  each  input  queues  of  the  node  whose  instruction  stream  was  accessed,  the 
Start _readqueue  event  is  produced. 

8)  Start jreadqueue:  This  event  keeps  checking  a  memory  module  and  assigned 
processor  until  memory  module  is  not  busy  and  processor  is  not  reading,  then  the 
Finish jreading  event  is  produced.  Reading  is  simulated  by  time  delay. 

9)  Finish_reading:  This  event  completes  the  reading  of  the  queue.  If  it  is  the  last 
queue  which  is  to  be  read,  then  Finish_setup  event  is  produced. 

10)  Finishjsetup :  If  processor  is  not  executing  and  break  >  stage  is  not  busy, 
setup  can  be  finished.  When  setup  is  finished,  Free  processor  and  v tan  execution  events 
are  produced.  Otherwise  finish_setup  event  is  reentered  with  a  new  time  stamp. 
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11)  Free  processor:  This  event  indicates  that  processor  is  free  and  produces  the 
Schedule  jajnode _Jrom_ready_list  event  is  produced. 

12)  Start jexecution: This  event  produces  Finishjexecution  event. 

13)  Finish_execution\  If  breakdown  and  setup  stages  are  not  busy,  then  produces 
the  Start _breakdown  event  else  checks  whether  setup  stage  is  waiting  for  execution  to 
finish  or  not.  When  setup  stage  is  waiting  for  execution  to  finish,  Finishjexecution  event 
is  produced  in  order  to  prevent  deadlock.  If  neither  setup  stage  is  waiting  for  execution  stage 
nor  setup  and  breakdown  stages  were  not  ready  then  Finishjexecution  event  reentered 
with  a  new  time  stamp. 

14)  start_breakdown:  This  event  produces  Start  jwrite_queue  event  for  each 
output  queue  and  indicate  the  assigned  processor  as  not  free.  Since  setup  and  breakdown 
can  not  happen  concurrently  in  this  model. 

15)  Start jwrite  jqueue:  If  memory  module  to  be  written  is  not  currently  busy 
and  processor  is  not  currently  writing,  then  the  Finish jwriting  event  is  produced  else 
Start jwritejqueue  is  reentered  with  a  new  time  stamp. 

16)  F  inish_writing:  This  event  completes  the  writing  to  the  queue.  If  it  is  the 
last  queue  to  be  written,  produces  the  Finish_breakdown  event.  Updates  the  output 
queue’s  current  length.  If  queue  length  is  over_threshold  produce  the 
Queue joverthreshold  event. 

17)  Finish _breakdown:Th\s  event  completes  the  breakdown.  If  the  node 
completed  is  the  last  output  node  of  that  instance,  it  is  assumed  that  an  instance  is  finished 
and  instance  finish  time  is  written  a  file  namely  endtimes. 
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18)  Queues joverthreshold:  This  event  checks  the  all  input  queues  whether  all  are 
over_threshold  or  not.  If  all  are  over_threshold,  then  the  Inputqueuesoverthreshold  event  is 
produced. 


19)  Free  scheduler.  This  event  marks  the  scheduler  as  not  busy  and  produces 
Schedule _a_node _Jrom_ready_list  event.  Scheduling  time  can  be  defined  by  the  user.  This  is 
useful  in  runs  when  the  overhead  of  scheduling  is  a  problem  research. 


D.  SIMULATOR  PROGRAM  DATA  STRUCTURES 

The  simulator  is  written  in  C++.  Its  data  structure  and  class  dependencies  are  depicted  in 
Figure  3.5.  The  simulator  code  is  given  in  Appendix  B. 

E.  USER  INTERFACE 
1.  Input 

To  execute  the  simulator  user  simply  invokes  the  executable  version  and  follows  the 
on-screen  prompts.  The  simulator  collects  data  about  machine  from  file  machine. config. 
Graph  data  is  stored  in  the  file  graph.dat.  Also  the  memory  assignments  is  given  in  the  file 
memory.dat.  A  sample  graph  data  and  a  machine  configuration  file  is  given  in  Appendix  A. 
Besides  the  graph  and  the  machine  files,  user  is  prompted  for  the  following  data: 

-  Instance  Start  Number.  The  instance  number  when  information  gathering  is 

started. 

-  Instance  End  Number.  The  instance  number  when  the  simulation  will  be 

completed. 

-  Instance  Overlap :  It  the  amount  of  overlap  between  the  multiple  instances  of 
the  same  node.  It  could  be  none  which  means  while  a  node  instance  is  ready,  another  instance 
of  the  same  node  can  not  be  ready.  Or  it  could  be  overlapped  which  means  multiple  instances 
of  a  node  can  be  ready  in  the  ready  node  list,  but  only  one  of  them  executed  at  a  time. 
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Mltst  Class 
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Figure  3.5.  Simulator  Program  Class  dependency  and  data  structure. 


-  Node  Execution  Priority:  It  is  the  priority  for  scheduling  the  nodes  in  the 
ready  node  list.  User  choose  of  the  following: 

Shortest  First  (Execution  time):  Nodes  in  the  ready  node  list  are 
ordered  by  their  execution  times. 

Longest  First  (Length):  Nodes  in  the  ready  node  list  are  ordered  by 

their  length. 

Random:  Node’s  priorities  are  defined  by  the  simulator  randomly. 

User  Defined:  Node’s  priorities  are  defined  by  the  user. 

No  Priority(FCFS) 

-  Data  Period:  User  may  want  to  repeat  the  simulation  for  different  data 
periods  without  changing  the  other  data.  This  time  in  order  to  prevent,  the  user  from 
reinvoking  the  simulator.  The  initial  data  period  and  final  data  period  with  increment 
amount  is  asked  by  the  simulator. 

Start  Period:  It  is  the  data  period  where  the  simulation  will  start. 

End  Period:  It  is  the  data  period  where  the  simulation  will  finish. 

Interval:  It  is  amount  of  increase  in  the  data  rate  for  after  each  simulation. 

2.  Output 

After  the  simulation  is  completed,  following  data  is  produced: 

-  Processor  Utilization  (Considering  setup  and  breakdown  as  useful 
utilization):  It  is  stored  in  file  utilization.dat. 

-  Processor  Utilization  (Only  execution):  It  is  stored  in  file  execution.dat. 

-  Throughput:  It  is  stored  in  file  throughputdat 

-  Response  Time:  It  is  stored  in  file  resptime.dat 

-  Coefficient  of  Variation:  It  is  ratio  of  instance  length  standard  deviation  to 
the  average  instance  length.  It  is  stored  in  file  Coefvar.dat 
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Files  also  contain  corresponding  normalized  data  rate  values. 

If  the  user  prefer  to  store  the  events  occurring  during  simulation,  a  log  file  can  be 
produced.  Log  file  includes  time,  event  name,  node  number,  queue  number,  memory 
module ,  processor  number.  Any  user  who  interested  in  different  kinds  of  statistics  can 
obtain  desired  information  by  filtering  the  log  file. 

F.  EXAMPLE  RUNS  OF  THE  SIMULATOR 

The  graph  depicted  in  Figure  5.1  is  executed  with  simulator.  The  graph  belongs  to  a 
real  application  which  is  called  correlator. 


The  results  belonging  to  correlator  graph  are  shown  in  Figure  3.7,  Figure  3.8,  Figure  3.9. 
In  resultant  graph,  X-coordinate  refers  to  the  normalized  data  rate  which  is  the  ratio  of 
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maximum  throughput  rate  to  the  input  data  rate.  Maximum  throughput  rate  means  total 
execution  time  over  number  of  processors.Y-coordinate  refers  to  throughput,  utilization  or 
coefficient  of  standard  deviation.  Throughput  is  the  number  instances  that  are  executed  in 
unit  time  in  this  simulator  which  one  second.  Utilization  shows  processor  usage  percent, 
obtained  by  dividing  processor  busy  time  to  total  execution  time.  Two  kinds  of  utilization 
is  calculated  for  processors,  one  consider  the  setup  and  breakdown  as  useful  utilization 
and  the  other  one  does  not.  For  each  data  rate  500  instance  were  executed  and  statistical 
result  are  obtained  from  last  300  hundred  instances. 


Figure  3.8.  Correlator  graph,  throughput. 
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IV.  GRAPH  RESTRUCTURING 


This  chapter  describes  the  graph  restructuring  that  enforces  RC  scheduling  in  detail. 
Also,  it  evaluates  the  potential  of  RC-based  scheduling  techniques  through  a  series  of 
experiments. 


Figure  4. 1  An  example  for  graph  restructuring. 


Figure  4. 1  describes  the  steps  that  are  needed  to  restructure  an  application  graph  and 
make  it  ready  for  execution.The  first  step  in  the  process  is  cylinder  mapping.  At  this  point, 

graph  nodes  are  mapped  to  the  cylinder  according  to  a  given  objective.1  The  circumference 


1 .  In  the  Figure  4.1b  the  mapping  is  based  on  fitting  the  graph  by  using  the  nodes  in  a  topologically 
sorted  list. 
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of  the  cylinder  is  determined  by  the  sum  of  the  nodes’  execution  times  divided  by  the 
number  of  processors  (an  approximation  which  does  not  consider  the  communication 
delay).  The  mapping  shown  in  Figure  4.1(b)  assumes  two  processors.  After  the  cylinder  is 
mapped,  the  next  step  is  index  assignment.  In  this  step,  nodes  are  given  relative  indices  that 
clarify  which  node  is  executing  which  relative  instance.  For  example  in  Figure  4.1(c),  while 
node  f  is  executing,  node  e  can  execute  one  later  instance.  Indices  axe  determined  by  the 
nodes’  locations  on  the  cylinder.Figure  4.1(c)  depicts  the  index  assignment.  After  the 
indices  are  assigned,  the  next  step  is  synchronization  arcs  creation  to  enforce  the  given 
schedule  in  Figure  4.1(c).  Figure  4.1(d)  shows  the  synchronization  arcs.  The 
synchronization  arcs  that  are  obtained  are  pruned  by  removing  the  redundant  arcs.  And  the 
resultant  arcs  are  added  to  the  original  graph.  The  restructured  graph  with  the  added 
synchronization  arcs  is  depicted  in  Figure  4.1(d).  The  program  that  restructures  the  graph 
is  called  Graph  Restructurer  (GR)  and  its  organization  is  represented  in  Figure  4.2. The 
source  code  of  GR  is  given  in  Appendix  C. 


.  „  .  Machine  Configuration 

Input^raph  |  ° 

Cylinder  Mapping 

ied  cylinder 

- ^  |Mapp 

Node  Index 
Assignment 

Cylinder  with  indices 

Creating 

Synchronization  arcs 

pendency  arcs 

_ ,  De 

Modify  original 
gTaph 

Restructured  Graph 

Figure  4.2.  The  structure  of  the  graph  restructurer. 
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The  GR  basically  has  three  functions,  cylinder  mapping,  index  assignment  and  the 
synchronization  arcs  creation. 

A.  CYLINDER  MAPPING 

The  cylinder  mapping  component  of  the  GR  takes  the  original  graph  and  machine 
configuration  as  input  data.  In  [LIT  91]  and  [BELL  92],  nodes  are  ordered  for  mapping 
according  to  their  execution  times.  The  graph  topology  is  not  considered.  This  produces 
more  dependency  arcs  increasing  the  communication  overhead.  Figure  4.3  represents  the 
dependency  arcs  for  the  same  graph  with  a  random  mapping.  This  random  mapping 
produces  7  synchronization  arcs,  but  the  mapping  which  considers  the  graph  topology 
produces  only  one  synchronization  arc.  In  Figure  4.4  the  mapping  algorithm  which 
considers  the  graph  topology  is  given. 

In  the  given  algorithm.  First  the  nodes  are  sorted  topologically,  then  mapping  starts 
from  the  root  node  and  continues  down  the  sorted  list  of  the  nodes.  The  mapping  heuristic 
is  to  map  a  child  node  to  an  empty  space,  such  that  it  can  start  after  its  parent’s  finish  time. 
If  no  such  space  can  be  found,  then  the  node  is  placed  in  the  earliest  start  empty  space 
(probably  resulting  in  a  synchronization  arc).  If  no  attempt  to  find  any  empty  space  is 
successful,  then  the  cylinder  circumference  is  incremented  by  a  percentage  specified  by  the 
user,  and  the  mapping  process  is  restarted  all  over  again. 
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procedure  Map_the_cylinder(G,C)  /*G  is  Directed  Acyclic  Graph* 

/*  C  is  the  cylinder*/ 

q  <—  topological_sort(G);  /*  q  is  a  queue*/ 
boolean  Success=  true 

for  each  node  sorted  topologically  in  the  graph 

Try  to  place  the  node  to  an  empty  space  starts  after  latest  end 
parent  node; 

if  there  is  no  empty  space 

try  to  place  the  node  to  the  empty  space  that  starts  before  the 
latest  ends  parent  node,  but  has  enough  space  to  place  the  node 
after  the  latest  parent  node; 
if  there  is  no  empty  space 

try  to  place  the  node  to  the  earliest  available  empty  space, 
else  if  there  is  no  empty  place 
success  =  false; 

increment  the  circumference  of  the  cylinder; 

Map_the_cylinder(G,C); 

exit; 

end  (for); 

end  Map_the_cylinder. 


Figure  4.4.  Algorithm  for  cylinder  mapping. 

B.  INDEX  ASSIGNMENT 

The  data  flow  execution  of  the  graph  requires  the  analysis  of  the  assignment  of  the 
nodes  to  the  cylinder  to  determine  which  instance  of  a  node  is  actually  to  be  executed.  The 
algorithm  for  index  assignment  is  given  in  [BELL  92]. 

In  this  algorithm,  an  arbitrary  node  is  assigned  an  index  of  zero  to  represent  the  fact 
that  it  is  the  current  instance  of  this  node  that  is  being  executed.  Every  parent  and  child  of 
this  node  is  examined  with  respect  to  its  relative  position  on  the  cylinder. 
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For  a  node  N,  Indices  of  the  parents  and  children  nodes  are  determined  according  to 
the  following  conditions: 

1.  If  a  parent’s  completion  time  on  the  cylinder  is  greater  than  the  node  N’s  start 
time  on  the  cylinder,  then  the  parent  must  be  executing  a  later  instance.Therefore,  the 

parent  is  given  an  index  which  is  incremented  by  one. 

2.  If  a  parent’s  completion  time  on  the  cylinder  is  less  than  or  equal  to  the  node 
N’s  start  time  on  the  cylinder,  then  the  parent  can  be  executing  the  same  instance  with  the 
current  node.  Therefore,  parent  is  given  the  same  index  as  node  N. 

3.1f  a  child’s  start  time  on  the  cylinder  is  less  than  the  node  N’s  finish  time  on  the 
cylinder  then  the  child  must  be  executing  a  previous  instance.  Therefore,  the  child  is  given 

an  index  which  is  decremented  by  one. 

4.  If  a  child’s  start  time  on  the  cylinder  is  greater  than  or  equal  to  the  node  N’ 
finish  time  on  the  cylinder  then  the  child  can  be  executing  the  same  instance  with  the  node 
N.  Therefore,  the  child  is  given  the  same  index  as  the  node  N. 

The  original  graph  and  the  cylinder  mapping  is  given  to  the  algorithm  as  input  and  the 
node  index  assignments  are  output  to  a  file. 

C.  SYNCHRONIZATION  ARCS  CREATION 

After  the  cylinder  assignment  and  corresponding  node  indices  are  established.  The 
required  synchronization  arcs  for  a  given  RC  schedule  can  be  determined. 

A  synchronization  arc  is  a  logical  arc  which  enforces  a  given  execution  sequence.  For 
example,  in  Figure  4.1(c),  node  b  and  node  c  can  be  executed  after  node  a’s  completion, 
node  d  can  be  executed  after  node  b's  completion,  node  e  can  be  executed  after  node  c's 
completion,  node/ can  be  executed  after  node  d  and  node  e’s  completion  and  node  a  can 
be  executed  after  nodef  s  completion.  The  execution  sequence  of  all  nodes  except  node  a 

2.  Some  value  greater  than  one  would  be  acceptable,  although  the  parents  output  queue  size  will 
place  an  upper  bound  on  that  value. 

3.  Some  value  less  than  one  would  be  acceptable,  although  the  child  input  queue  size  will  place  an 
lower  bound  on  that  value. 
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is  satisfied  by  the  data  dependency.  To  control  the  execution  of  node  a,  we  need  an  arc  from 
node  f  which  will  trigger  node  a  after  it  completes  ->  execution.  This  trigger  is  called 
synchronization  arc.  These  synchronization  arcs  can  be  implemented  as  input  queues  or 
output  queues.The  synchronization  arcs  implemented  as  input  queues  are  called  logical 
synchronization  arcs,  and  the  synchronization  arcs  implemented  as  output  queues  are  called 
physical  synchronization  arcs. 

In  order  to  understand  what  logical  and  physical  synchronization  arcs  means,  let  us 
look  the  Figure  4. 1  again.  In  this  Figure,  after  restructuring  the  original  graph  we  obtain  an 
synchronization  arc  from  node  f  to  node  a  with  two  initial  tokens.  It  simply  prevents  the 
node  a  s  third  execution  until  node  f  finishes  its  first  execution.  This  arc  can  be 
implemented  as  an  input  token  queue  which  has  an  initial  number  of  tokens,  a  threshold, 
and  a  consumption  and  production  amount.  During  the  execution,  these  tokens  are 
produced  and  consumed  according  to  the  sink  and  source  node  execution. 

Enforcing  the  RC  schedule  with  logical  arcs  is  called  Start  After  Finish  (SAF) 
approach.  It  can  be  easily  seen  from  the  Figure  4.1e.  node  a  can  not  start  its  third  execution 
until  node  b  finishes  its  first  execution. 

The  same  synchronization  arc  can  also  be  enforced  by  putting  a  physical  arc.  This 
physical  arc  can  be  implemented  as  an  output  token  queue.  In  this  approach,  execution 
sequence  is  controlled  by  the  queue  capacity  instead  of  queue  threshold  quantity.  Figure  4.5 
depicts  the  physical  synchronization  arc  and  the  logical  synchronization  arc  which 
correspond  to  the  same  synchronization  arc. 


Enforcing  the  RC  schedule  with  physical  arcs  is  called  Start  After  Start  (SAS) 
approach.  It  can  be  easily  seen  from  Figure  4.4  (b)  that  node  a  can  not  start  its  third 
execution  until  node  f  starts  its  first  execution.The  algorithm  for  SAF  and  SAS  approaches 
are  given  in  Figure  4.6  and  Figure  4.7  respectively. 


procedure  Restructure_graph_with_SAF  (Cylinder); 

/*nr,ns  are  nodes  of  graph,  G*/ 
for  all  nodes,  nr 
check  index  i  of  nr 
find  the  latest  node,  ns  that  ends 
before  nr  starts  on  the  cylinder 
check  index  j  of  ns 

/*if  nr  starts  at  the  top  of  the  cylinder,  the  latest*/ 
/*node  ends  at  the  bottom  of  the  cylinder.*/ 

/*In  this  case,  j  should  be  decremental  by  one*/ 

introduce  a  synchronization  arc  from  ns  to  nr 
if  i  >  j 

put  i  -  j  initial  tokens  on  the  arc 
set  threshold  =  1,  consume  =  1 
else  if  i  <  j 

put  0  initial  tokens  on  the  arc 
set  threshold  =  j-i+1,  consume  =  l 
end  (for). 

igure  4.6.  Algorithm  to  generate  logical  synchronization  arcs. 
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procedure  Restructure_graph_with_SAS  (Cylinder); 

/*nr,ns  are  nodes  of  graph,  G*/ 
for  all  nodes,  nr 
check  index  i  of  nr 
find  the  latest  node,  ns  that  starts 
before  nr  starts  on  the  cylinder 
check  index  j  of  ns 

/*if  nr  starts  at  the  top  of  the  cylinder,  the  latest*/ 

/*node  starts  at  the  bottom  of  the  cylinder.*/ 

/*In  this  case,  j  should  be  decremented  by  one*/ 

introduce  a  synchronization  arc  from  ns  to  nr 
if  i  >  j 

put  i  -  j  initial  tokens  on  the  arc 
set  threshold  =  1 ,  consume  =  1 
else  if  i  <  j 

put  0  initial  tokens  on  the  arc 
set  capacity  =  j-i+1,  consume  =  1,  threshold  =  1 
end  (for). 

ngure  4.7  Algorithm  to  generate  physical  synchronization  arcs. 

D.  PERFORMANCE  EVALUATION 

In  our  performance  evaluation  experiments,  we  use  two  sample  graphs  one  is  a  graph 
which  is  used  for  active  sonobuoys  and  the  other  is  an  artificial  test  graph  with  a  controlled 
topology.  The  benchmark  graph  consists  of  50  nodes  and  139  data  queues.  It  is  depicted  in 
Figure  4.8.The  small  test  graph  consists  of  25  nodes  and  27  data  queues.  It  is  depicted  in 
Figure  4.9. 

In  the  experiments,  in  order  to  minimize  the  effect  of  the  communication  overhead,  the 
backward  synhronization  arcs  from  lower  nodes  to  the  upper  nodes,  are  removed 
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Figure  4.8.  Active  sonobuoy  benchmark  graph. 
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A1  execution  times  are  given  in  cycles. 
Figure  4.9.  A  artificial  test  graph. 
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1.  Basic  Performance  Measurements 

In  this  experiment,  we  examine  the  performance  of  SAS,  SAF,  and  FCFS 
scheduling  techniques  running  on  different  numbers  of  processors  and  arbitrary  queue  to 
memory  mapping.  The  experiment  is  repeated  twice,  assuming  high  and  low 
communication  overheads.  Low  and  high  communication  overheads  correspond  to  0.15 
and  0.75  percent  communication-computation  ratio  respectively.  In  order  to  chose  a  data 
rate,  first  FCFS  scheduling  is  simulated  and  the  data  rate  which  saturates  the  machine  is 
chosen.  The  cylinder  then  is  mapped  at  this  data  rate.  The  results  demonstrate  that  FCFS 
has  a  high  throughput  in  most  cases. 

Figure  4.10  and  figure  4.11  depict  the  throughput  results  from  the  benchmark 
graph  and  artificial  test  graph  respectively.  In  Figure  4. 10,  SAF  scheduling  exhibits  slightly 
better  throughput  than  FCFS  and  SAS  scheduling  techniques. 


Figure  4. 10.  The  throughput  obtained  from  the  benchmark  graph. 
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Figure  4. 1 1  The  throughput  obtain  from  the  artificial  test  graph. 


In  Figure  4.11,  for  high  communication  overheads,  SAS  demonstrates  better 


throughput  in  most  cases,  but  for  low  communication  SAS  and  FCFS  are  exhibiting  almost 


same  throughput. 
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Figure  4.12  shows  the  processor  utilization  of  benchmark  graph.  The  results 
demonstrate  The  FCFS  scheduling  has  slightly  better  utilization  than  SAS  and  SAF 
scheduling  techniques  in  all  cases. 

Figure  4.13  depicts  the  coefficient  of  variation  (COV)  of  the  iteration  length 
distribution.  This  figure  measures  the  regularity  of  the  output  production  which  is  at  most 
importance  in  real-time  systems.  As  revealed  by  this  figure,  none  of  the  scheduling 
technique  is  conclusively  better  than  the  others  with  respect  to  COV. 


Figure  4.13.  The  Coefficient  of  Variation  obtained  from  benchmark  graph. 


2.  Evaluating  the  Effect  of  Memory  Mapping 

The  second  experiment  is  performed  to  see  if  mapping  the  queues  to  memory 
modules  to  minimize  the  access  conflicts  will  give  better  results  than  the  randomly  mapped 
memory  modules.  The  mapping  is  done  maually  by  inspecting  th  RC  schedule.The 
experiment  is  performed  assuming  30  percent  communication  overheads.  In  the 
benchmark  graph,  10  memory  modules  and  5  processors  are  used,  and  the  obtained  results 
are  depicted  in  Figure  4.14.  In  the  artificial  test  graph, 10  memory  modules  and  3  processors 
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are  used,  and  the  obtained  results  are  shown  in  Figure  4.15.  As  revealed  by  the  figures, 
mapping  the  queues  to  memories  according  to  the  cylinder  assignment  gives  better 
throughput  than  the  random  mapped  memories.(The  relative  change  in  throughput  is 


minuscale,  however). 


Figure  4. 14.  The  effect  of  the  memory  mapping  on  scheduling  techniques  (Benchmark) 


Graph  Name:  Artificial  Test  graph 
Number  of  Memory  Modules:  10 
Number  of  Processors:  4 
Communication:  30  percent 
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3.  Effect  of  the  Queue  Size  on  the  System  Throughput 

This  experiment  is  performed  to  watch  the  effect  of  changing  the  data  queue  sizes 
with  respect  to  FCFS,  and  compare  that  the  performance  of  SAS  and  SAF  assuming  the 
queue  sizes  dictated  by  the  RC  assignment.  The  experiment  was  performed  on  the  artificial 
test  graph  with  4  processors  and  10  memory  modules.  The  data  rate  which  saturates  the 
machine  in  FCFS  assuming  very  large  queue  sizes,  is  chosen.  Different  queue  sizes  were 
tried.  The  results  are  demonstrated  in  Figure  4.16.  For  the  given  data  rate,  the  experiment 
results  reveal  that  queue  size  does  not  effect  the  throughput  for  FCFS. 


Graph  Name:  Artificial  Test  graph 
Number  of  Memory  Modules:  10 
Number  of  Processors:  4 
Communication:  30  percent 
Normalized  data  rate:  0.90 


FCFS 
RC  (SAS) 
RC  (SAF) 


Queue  Size  ■  1  *  Data  size  Queue  Size  *2*  Dau  Size 
5  4.76  4.76 


Qoeae  Size  »  4  •  Dau  Size 

4.76  4.74  476 


Figure  4.16  The  effect  of  the  data  queue  size  on  throughput  with  FCFS, SAS  and  SAF 
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V.  CONCLUSION  REMARKS 


A.  CONCLUSIONS 

In  this  thesis,  a  simulator  (PIPDAFS)  which  is  used  for  Large  Grain  Data  Flow 
machines,  and  a  graph  restructurer  (GR)  which  determines  the  node  dependencies 
according  to  a  given  strategy  and  restructures  the  original  graph  were  developed.  A  series 
of  experiments  were  performed  by  using  GR  and  PIPDAFS.  GR  and  PIPDAFS  are  a  kind 
of  tool  that  can  be  used  to  test  the  different  scheduling  strategies. 

In  our  experiments,  we  compared  FCFS  scheduling  with  two  other  scheduling 
techniques  in  which  the  graphs  were  restructured  and  the  nodes  were  executed  according 
to  their  graph  topologies.  While  in  theory,  restructuring  techniques  should  improve  the 
performance,  the  result  of  the  experiments  demonstrate  that  this  improvement  is  meager. 
We  believe  that  the  reason  for  these  results  is  that  the  mapping  techniques  employed  in 
these  experiments  completely  ignored  the  effects  of  communication.  We  believe  that 
combining  the  restructuring  techniques  with  a  mapping,  while  considering  communication 
overhead,  would  give  results  closer  to  the  anticipated  theoretical  results. 

B.  FUTURE  WORK 

As  noted  from  the  experimental  results,  a  new  mapping  algorithm  which  considers  the 
communication  overhead  should  be  designed. 

Algorithms  for  allowing  the  complete  communication-computation  overlap  and 
deterministic  nonconflicting  resource  usage  should  be  identified. 

Currendy  synchronization  arcs  that  are  obtained  through  restructuring  are  manually 
pruned.  Some  work  is  needed  in  the  area  of  determining  the  minimal  number  of 
synchronization  arcs  required  to  satisfy  the  graph  restructuring,  as  the  number  of 
synchronization  arcs  can  be  potentially  be  a  major  factor  in  the  overall  system  performance. 

In  this  research,  a  dynamic  node  to  processor  assignment  technique  was  employed. 
Using  a  static  processor  assignment  technique  could  be  a  step  towards  achieving  more  run 
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-time  determinism.  This  should  be  critically  weighed  against  the  utility  of  fully  dynamic 
processor  assignment  techniques  with  respect  to  fault  tolerance. 
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APPENDIX  A  :  Sample  Graph  File 
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APPENDIX  B:  Simulator  Source  Code 


//  Author :  Cem  Akin 
//  Advisor  :  Amr  Zaky 

//  Description  :  LGDF  machine  simulator  class  header  file 
//  Date  :  12  November  1992 
//  Last  Revised  :  03  January  1993 

#include"scheduler.h" 

#include”mlist.h” 

#include  <fstream.h> 

class  simulator  { 
public  : 

simulator(); 

~simulator(); 
void  simulate(); 

void  produce_readqueues(gnode*,  event*); 
void  produce_writequeues(gnode*  .event*); 
void  initialize_events(int); 
void  process_event(event*,fstream&,fstream&.int); 
void  consume_q(int); 

voidprintstatistics(fstream&,fstream&,intjnt,double.double); 

void  increasewriteamount(gnode  *); 

void  increasereadamount(gnode  *); 

void  decreasewriteamount(gnode  *); 

void  decreasereadamount(gnode  *): 

boolean  assign_processor(gnode  *); 

boolean  arealloutputqsready(gnode  *); 

boolean  areallqsot(gnode  *); 

boolean  arealloutputnodesexecuted(); 

void  initializeoutputnodes(); 

private  : 

scheduler*  sch; 
mlist  *  ml; 
plist*  pi; 

Qlist  *  gqueuelisl; 
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nlist  *  gnodelist; 
pqueue  *  eventqueue; 
int  startinstance. 
finishinstance. 
inststart. 
fixedcomm; 

boolean  onlyonecanbeready, 

incremented; 

double  clock, 

interval, 

communication, 

comm; 


//  Author :  Cem  Akin 
//  Advisor :  Amr  Zaky 

//  Description  :  LGDF  machine  simulator  class. 
//  Date  ;  12  November  1992 
//  Last  Revised :  03  January  1993 
#include<iostream.h> 

#include”simulator.h” 

#include<math.h> 

//  Constructor  of  simulator  class. 
simulator::simulator()( 
clock  =  0; 

sch  =  new  scheduler 
ml  =  new  mlist; 
pi  =  new  plist; 
startinstance  =  0; 
finishinstance  =  0; 
incremented  »  false; 
gqueuelist  =  new  Qlist; 
gnodelist  =  new  nlist; 
eventqueue  =  new  pqueue; 
communication  =  0.0; 
comm  =1; 
fixedcomm  =100; 

1; 

//  Destructor  of  simulator  class 
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simulator:  :~simulator()  { 
delete  sch: 
delete  ml: 
delete  pi; 
delete  gqueuelist; 
delete  gnodelist; 
delete  eventqueue; 
sch  =  NULL; 
ml  =  NULL: 
pi  =  NULL: 
gqueuelist  =  NULL; 
gnodelist  =  NULL; 
eventqueue  =  NULL: 
I: 


//  This  function  initialize  the  simulator  to  start  the  simulation, 
void  simulator::initialize_events(int  period)! 
nlist*  l=gnodelist; 
event*  tempevent; 

//  For  each  input  node  produces  an  event  that  activates  the  simulator 
for(l->cuiTent=l->head;l->current;l->cuiTent=l->current->nextitem)( 
if  (l->current->element->ntype==input)  1 
l->current->element->dataperiod  =  period; 
tempevent  =  new  event; 

tempevent->eventname=reach_production_period: 
tempevent->starttime=c  lock; 
tempevent->priority  =clock, 
tempevent->nodenum  =l->current->element->nodeid; 
eventqueue->enqueue(tempevent); 


I: 


//  This  function  reads  the  input  files,  gets  user  choices,  initializes  the 
//simulator  and 
void  simulator::simulate(){ 
fstream  myfde, 
startfde, 
endfile. 
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userdata, 

inputperiod. 

config, 

mem. 

logfile; 

myfile.open("graph.dat",ios::in); 

config.open("machine.config",ios::in); 

mem.open(“memodules”,ios::in); 

int  index. 

nummemos. 

choice, 

option, 

option2, 

instancenum, 

dperiod; 

double  drate; 

intnode*  tnode; 

event*  tempevent; 

startfile.open(“starttimes”,ios:  :out); 

endfile.open(“endtimes’’40s::out); 

tempevent  =  new  event: 

userdata.open(“userdat”,ios:  :in); 

inputperiod.open(“inperiod",ios::in); 

userdata  » inststart; 

userdata  » instancenum: 

userdata  »  option; 

if  (option  ==  1) 

onlyonecanbeready  =  true; 
else 

onlyonecanbeready  =  false; 
userdata  »  choice; 
userdata  »  option2; 
if  (option2)( 

logfile.open(“log_ftle"aos:;applios::out); 

I: 


nummemos=ml->loadmemory(config); 
int  pno; 
config  »pno; 

int  pnum  =  pl->loadpiocessors(config,pno); 
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for  (index=l:index<=pno;index++)( 
tnode  =  new  intnode; 
tnode->setnode(index,0); 
sch->ffeeproclist->enqueue(tnode); 


int  tottime  =  gnodelist->loadnodes(myfile.mem.choice,nummemos); 

gqueuelist->loadqueues(myfile,mem.gnodelist,nummemos): 

rnyfllexIoseO; 

inputperiod  »  dpericxl: 

initialize_events(dperiod); 

drate  =  double  (double! tottime  /  pnum)/dperiod); 

cerr « ’^SIMULATION  IS  IN  PROCESS  PLEASE  WAIT  !!!!\n”; 

while  ((!  (eventqueue->emptyO))&&  (instancenum>finishinstance))  ( 
tempevent  =  eventqueue->dequeueO; 
process_event(tempevent,startfile,endfile,instancenum); 
if  (option2){ 

logfile  «  clock  «  “ 
tempevent->printevent(logftle); 

}; 

); 


startfile.closeO; 

endfile.closeO; 

inputperiod.closeO; 

userdata.close(); 

config.closeO; 

logfile.closeO; 

printstatistics(startfile,endftie4nstancenum-inststart+ 1 4nstancenum,drate,tottime); 

»: 


//  This  function  processes  the  given  event  and  enqueue  the  produced  events  to 
//  event  queue. 

void  simulator::process_event(event  *e  Jstream&  startfile. 
fstream&  endfile.int  instancenum){ 
event  *  tempevent=new  event; 
gnode  *  tempnode ; 

Queueltem  *  tempqueue; 
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memory  *  tempmemoiy; 
processor  *  tempproc; 
intnode  *mode  =new  intnode; 
int  duration; 

clock  =  e->starttime;  //Advance  the  system  clock  to  event  starttime 
switch  (e->eventname){ 

//if  input  data  period  has  been  reached 
case  reach_production_period:  ( 

tempnode=gnodelist->getnode(e->nodenum); 
tempevent->eventname=inqsoverthreshold;  //Produce  event  that 
tempevent->starttime=clock;  //indicates  input  queues 
tempevent->priority  =clock;  //are  overthreshold. 
tempevent->nodenum  =  e->nodenum;  //It  is  assumed  that  amount 
eventqueue->enqueue(tempevent):  //of  data  at  each  period  is 
//large  enough  to  make  input 
//queues  overthreshold 

tempevent  =  new  event;  //Produce  next  data  period  event 

tempevent->eventname=reach_production_period; 

tempevent->starttime=cIock+tempnode->getdataTate(); 

tempevent->priority  =  tempevent->starttime: 

tempevent->nodenum  =  e->nodenum: 

eventqueue->enqueue(tempevent); 

break; 

): 


// if  all  node  input  queues  are  overthreshold 
case  inqsoverthreshold:  { 

tempnode  »  gnodelist->getnode(e->nodenum); 
if  (arealloutputqsready(tempnode)litempnode->getnodetype()==output)  1 
tempevent->eventname  =  ready_node;//If  all  node  output  queues 
tempevent->starttime  =  clock;  //have  space  or  node  is  an 
tempevent->priority  =  clock;  //output  node, then  produce 
tempevent->nodenum  =  e->nodenum:  //node  is  ready  event 
eventqueue->enqueue(tempevent); 

} 

else  { 

if  (tempnode->ntype==instruction)| 

tempnode->waitingforoutput  =  true;//Otherwise  set  the  flag  to 
tempnode->waitingnumber-H-;  //indicate  node  is  waiting  for 
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1; 


break; 

): 


//if  node  is  ready  to  be  executed 
case  ready_node:  f 

tempnode  =  gnodelist->getnode(e->nodenum); 

/Af  node  can  not  be  put  into  the  ready  list  decrease  the  readamount 
//which  is  previously  incremented  to  prevent 
if  (onlyonecanbeready  &&  (sch->member(e->nodenum))) 
decreasereadamount(tempnode); 

//If  node  is  not  already  in  rl  then  put  node  to  rl  and  schedule  a 
//from  rl. 

if  (onlyonecanbeready  &&  (!(sch->member(e->nodenum)))){ 
sch->putnodeinrl(gnodelist->getnode(e->nodenum)); 
increasewriteamount(tempnode); 

I 

else{  //If  node  is  already  in  rl  increment  number  of  waiting  ready 
//nodes 

if  (onlyonecanbeready) 
tempnode->readycount++; 

else{  //  If  more  than  one  node  can  be  ready  at  any  time 
//put  node  to  rl.  and  schedule  a  node 
sch->putnodeinrl(gnodeiist->getnode(e->nodenum)); 
increase  writeamount(tempnode); 


tempevent  =  new  event; 

tempevent->eventname  =  schedule_a_node_from_rl; 
tempevent->starttime  =  clock; 
tempevent->priority  =  clock; 
eventqueue->enqueue(  tempevent); 
break; 

I; 

//If  rl  is  not  empty  schedule  a  node  from  rl. 
case  schedule_a_node_firom_rl:  { 
if  (!(sch->emptyrl())){ 
if  (!(sch->isbusy())) 

sch->schedule_node(gnodelist,pl,eventqueue,clock); 

}; 
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break; 


I: 

// If  node  setup  is  started 
case  start_setup :  ( 

tempnode=gnodelist->getnode(e->nodenum); 
tempproc  =  pl->getproc(e->assocproc); 
if  C(incremented)) 

if  (tempnode->getnodetype()=input&& 
startinstance<=tempnode->getnodeinstance())| 
incremented  =  true; 
startinstance++; 
if  (startinstance  >=  inststart) 

startfile  «  startinstance  «  “  “  «  clock  «endl; 
if  (startinstance  ==  inststart) 
interval  =  clock; 


if  ((tempnode->getsetuptimeO>=0)){ 

if  ((!(tempproc->isbreakdownbusy()))&& 
( !  (tempproc ->isexecbusy()»)  { 
tempproc->startutil=clock; 

1: 


tempproc=pl->getproc(e->assocproc); 

if  (( !  (tempproc ->isbreakdownbusy()))ll 

tempnode->getsetuptime()=0){  //  If  breakdown  is  not  busy 

tempproc->setsetupbusytill(clock+tempnode->getsetupttnie()); 

tempproc->setsetupbusy(true); 

tempevent  =  new  event; 

tempproc ->waitingonexec  =  false; 

incremented  =  false; 

tempevent->eventname  =  fmish_setup; 

tempevent->starttime  =  c!ock+tempnode->getsetuptimeO; 

tempevent->priority  =  tempevent->starttime; 

tempevent->nodenum  =  e->nodenum; 

tempevent->queuenum  *  e->queuenum; 

tempevent->assocproc  =  e->assocproc; 

tempevent->assocmem  *  e->assocmem; 

eventqueue->enqueue(  tempevent); 
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else}  //  Otherwise  start  to  read  after  setup  finished 
tempevent  =  new  event: 
tempevent->eventname  =  start_setup; 
tempevent->starttime  =  tempproc->getbreakdownbusytime(); 
tempevent->priority  =  tempevent->starttime; 
tempevent->nodenum  =  e->nodenum; 
tempevent->assocproc  =  e->assocproc: 
tempevent->assocmem  =  tempnode->memid: 
eventqueue->enqueue(tempevent); 


if  ((!(tempproc->isbreakdownbusyO))&& 

( !  (tempproc->  isexecbusyO))) } 
tempproc->startutil=clock; 

>; 

tempproc=pl->getproc(e->assocproc): 

if  (!(tempproc->isbreakdownbusy())){  //  If  breakdown  is  not  busy 
tempevent  =  new  event:  //start  to  read  primitive  inst 
tempproc->setsetupbusytill(clock+tempnode->getsetuptime()); 
tempproc  ->setsetupbusy(true); 
tempproc->waitingonexec  =  false: 
incremented  =  false: 

ternpevent->evenmame  =  start_reading_instmction_stream: 
tempevent->starttime  =  clock: 
tempevent->priority  =  clock; 
tempevent->nodenum  =  e->nodenum; 
tempevent->assocproc  =  e->assocproc; 
tempevent->assocmem  =  tempnode->memid: 
eventqueue->enqueue(  tempevent): 

} 

else}  //Otherwise  start  to  read  after  setup  finished 
tempevent  =  new  event; 

tempevent->eventname  =  start_reading_instruction_stream: 
tempevent->starttime  =  tern  pproc->getbreakdownbusy  time}): 
tempevent->priority  =  tempevent->starttime; 
tempevent->nodenum  =  e->nodenum; 
tempevent->assocproc  =  e->assocproc; 
tempevent->assocmem  =  tempnode->memid; 
eventqueue->enqueue(tempevent); 
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}; 

break; 


//Start  to  read  primitive  instruction 
case  start_reading_instruction_stream;  { 
tempevent  =  new  event; 
tempmemory  =  ml->getmem(e->assocmem): 
tempproc  =  pl->getproc(e->assocproc); 
tempnode  =  gnodelist->getnode(e->nodenum); 
if  (!(tempmemory->isbusy())){  //  if  memory  is  not  busy 
//Calculate  reading  delay  and  finish  reading  after  this  delay 
duration  =  (int  (fixedcomm  +  comm*tempnode->aissize)); 
tempproc ->setsetupbusytill(clock+duration); 
tempmemory->setbusy(tnie); 
tempmemory->setbusytill(clock+duration); 
tempevent->eventname  =  finish_reading_instruction_stream; 
tempevent->starttime  =  clock+duration; 
tempevent->priority  =  clock+duration; 
tempevent->nodenum  =  e->nodenum; 
tempevent->assocproc  =  e->assocproc; 
tempevent->assocmem  =  e->assocmem; 
eventqueue->enqueue(tempevent); 

1 

else  |  //otherwise  try  to  read  after  memory  become  available 
tempproc->setsetupbusytill(tempmemory->getbusytime()); 
tempevent  =  new  event; 

tempevent->eventname  =  start_reading_instruction_stream; 
tempevent->starttime  =  tempmemory->getbusytime(); 
tempevent->priority  =  tempevent->starttime; 
tempevent->nodenum  =  e->nodenum; 
tempevent->assocproc  =  e->assocproc; 
tempevent->assocmem  =  e->assocmem; 
eventqueue->enqueue(tempevent); 

}; 

break; 

I; 


//finishes  the  reading  of  primitive  instructions 
case  finish_reading_instruction_stream :  ( 

tempmemory  =  ml->getmem(e*>assocmem); 


57 


tempnode  =  gnodelist->getnode(e->nodenum); 
tempmemory->setbusy(false); 

//Produce  read  queues  for  output  queues 
produce_readqueues(tempnode.e ); 
break; 
l; 


//  Starts  to  read  specified  input  queue  if  processor  is  not  already 
//reading  and  memory  to  be  read  from  is  not  busy 
case  start_readqueue:( 

tempqueue  =  gqueuelist->getqueue(e->queuenum); 

tempmemory=  ml->getmem(tempqueue->getgmid()); 

tempproc=pl->getproc(e->assocproc); 

if  (tempqueue->qtype  ==  syn_arc)( 
tempevent  =  new  event; 
tempevent->eventname=finish_reading; 
tempevent->starttime=clock; 
tempevent->priority=tempevent->starttime; 
tempevent->queuenum=e->queuenum; 
tempevent->nodenum=e->nodenum; 
tempevent->assocproc=e->assocproc; 
tempevent->assocmem  =  tempqueue->getgmid(); 
eventqueue->enqueue(tempevent); 

I 

else{ 

if  (!(tempmemory->isbusyO»{ 
if  (!(tempproc->readinginprocess»{ 
tempproc->readinginprocess  =  true; 

duration  =(int(fixedcomm+comm*tempqueue->consurnptionqty)): 
tempproc->setsetupbusytill(clock+duration); 
tempmemory->setbusy(tiue); 
tempmemory->setbusytilI(clock+duration); 
tempevent->eventname=finish_reading; 
tempevent->starttime=clock+duration; 
tempevent->priority=tempevent->starttime; 
tempevent->queuenum=e->queuenum; 
tempevent->nodenum=e->nodenum; 
tempevent->assocproc=e->assocproc; 
tempevent->assocmem  =  tempmemory->getmemidO: 
e  ventqueue->enqueue(tem  pevent); 

} 
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else{  //if  processor  is  already  reading 

tempevent->eventname=start_readqueue; 

tempevent->starttime=tempproc->setupbusytill; 

tempevent->priority=tempevent->starttime; 

tempevent->nodenum=e->nodenum; 

tempevent->queuenum=e->queuenum; 

tempevent->assocproc=e->assocproc: 

eventqueue->enqueue(tempevent); 

); 


else{  //If  memory  to  be  read  is  busy 

tempproc->setsetupbusytill(tempmemory->getbusytime()); 
tempevent->eventname  =  e->eventname; 
tempevent->starttime  =  tempmemory->getbusytime(): 
tempevent->priority  =  tempevent->starttime; 
tempevent->queuenum  =  e->queuenum; 
tempevent->nodenum  =  e->nodenum; 
tempevent->assocproc  -  e->assocproc; 
tempevent->assocmem  =  tempmemory->getmemid(); 
eventqueue->enqueue(tempevent); 

1; 

I; 

break; 


//Mark  processor  as  not  busy  and  look  if  there  are  waiting  ready 
//nodes  for  a  processor, 
case  free_processor:  { 

tempproc  =  pl->getproc(e->assocproc); 
tempproc->setffee(true); 
tnode->setnode(  tempproc->proc  id.O): 
sch->ffeeproclist->enqueue(tnode); 
tempevent->eventname  =  schedule_a_node_from_rl; 
tempevent->starttime  =  clock; 
tempevent->priority  =  clock; 
eventqueue->enqueue(tempevent); 
break; 

I; 


//Starts  execution 


59 


case  start_execution :  { 

tempnode=gnodeUst->getnode(e->nodenum): 

//Get  the  execution  delay 

duration=tempnode->getexecti'ne(); 

tempproc=pl->getproc(e->assocproc); 

tempevent  =  new  event; 

tempevent->eventname  =  start_execution; 

tempevent->starttime  =  tempproc->getexecbusytime(): 

tempevent->priority  =  tempevent->starttime; 

tempevent->nodenum  =  e->nodenum; 

tempevent->assocproc  =  e->assocproc; 

eventqueue->enqueue(tempevent); 

break; 

I; 


//Finishes  execution  if  breakdown  and  setup  stages  are  not  busy 
case  finish_execution :  { 

tempnode=gnodelist->getnode(e:>noder  m); 
tempproc=pl->getproc(e->assocproc); 
if(!(tempproc->isbreakdownbusy())){ 
if  (!(tempproc->issetupbusy())){ 
tempproc->setexecbusy(false): 
tempproc->setbreakdownbusy(irue); 
tempproc->setbreakdownbusytiil(clock); 
tempnode->lastinstance++; 
tempevent->eventname  =  start_breakdown; 
tempevent->starttime  =  clock; 
tempevent->priority  =  clock; 
tempevent->nodenum  =  e->nodenum; 
tempevent->assocproc  =  e->assocproc; 
eventqueue->enqueue(tempevent); 

} 

else  (//Otherwise  indicate  that  execution  is  waiting  for  setup 
tempproc->waitingonsetup  =  true; 

if  (tempproc->waitingonexec)  { //This  part  prevents  the  deadlock 
tempproc->setsetupbusy(false); 
tern  pproc ->sete  xecbusy( false ); 
tempproc->waitingonsetup=false; 
tempevent->stantime=clock; 

1 
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else 

tempevent->starttime=  tempproc->getsetupbusytime(): 

tempproc->setexecbusytill(tempevent->starttime): 
tempevent->eventname  =  finish_execution; 
tempevent->priority  =  tempevent->starttime; 
tempevent->nodenam  =  e->nodenum: 
tempevenc->assocproc  =  e->assocproc: 
eventqueue->enqueue(tempevent); 

>: 

) 

else!  //If  breakdown  is  busy,  try  to  finish  execution 

tempproc->setexecbusytill(tempproc->getbreakdownbusytime()); 
tempevent->eventnanie  =  finish_execution; 
tempevent->starttime  =  tempproc->getbreakdownbusytime(); 
tempevent->priority  =  tempevent->staittime; 
tempevent->nodenum  =  e->nodenum; 
tempevent->assocproc  =  e->assocproc; 
eventqueue->enqueue(tempevent); 

I; 

break; 

I: 


//Starts  to  breakdown  and  produces  start  write  queue 
case  start_breakdown  :  { 

tempnode=gnodelist->getnode(e->nodenum); 

tempproc=pl->getproc(e->assocproc); 

tempproc->setbreakdownbusytill(clock+tempnode->getbreakdowntime()); 

tempproc->setbreakdownbusy(true); 

//Produces  write  queues 

produce_writequeues(tempnode,e): 

break; 


//Starts  writing  to  the  specified  GM  if  processor  is  not  already 
//writing  and  memory  to  be  written  is  not  busy 
case  start_write_queue:  { 

tempqueue=gqueuelist->getqueue(e->queuenum); 
tempproc=pl->getproc(e->assocproc); 
if  (tempqueue->qtype=syn_arc){ 
tempevent  =  new  event; 
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tempevent->eventname  =  fmish_writing; 
tempevent->starttime  =  clock: 
tempevent->priority  =  tempevent->starttime; 
tempevent->nodenum  =  e->nodenum: 
tempevent->queuenum  =  e->queuenum; 
tempevent->assocpioc  =  e->assocproc; 
tempevent->assocmem  =  tempqueue->getgmid(); 
eventqueue->enqueue(tempevent): 

I 

else (  • 

tempnode  =  gnodelist->getnode(e->nodenum); 
if  (tempnode->getbreakdowntime()>=0){ 

tempproc->setbreakdownbusytill(clock+tempnode->getbreakdowntime()); 
tempmemory=ml->getmem(tempqueue->getgmid()); 
tempevent->evemname  =  fmish_writing; 
tempevent->starttime  =  clock+tempnode->getbreakdownrime(): 
tempevent->priority  =  tempevent->starttime; 
tempevent->nodenum  =  e->nodenum; 
tempevent->queuenum  =  e->queuenum; 
tempevent->assocproc  =  e->assocproc; 
tempevent->assocmem  =  tempmemory->getmemid(); 
eventqueue->enqueue(tempevent): 

} 

else( 

//Calculate  the  write  process  delay 

duration  =(int(fixedcomm+comm*tempqueue->getproductionqty())); 
ternpmemory=ml->getmem(tempqueue->getgmid()); 
if  ((!(tempmemory->isbusy()))lltempnode->getnodetype()==input)  ( 
if  (!(tempproc->writeinprocess)){ 
tempproc->writeinprocess  =  true: 
tempproc->setbreakdownbusytill(clock+duration): 
if  (tempnode->getnodetype()==inst)| 
tempmemory->setbusy(true): 
tempmemcH7->setbusytill(duration+clock): 

I: 

tempevent->eventname  =  finish_writing; 
tempevent->starttime  =  clock+duration; 
tempevent->priority  =  tempevent->starttime: 
tempevent->nodenum  =  e->nodenum; 
tempevent->queuenum  =  e->queuenum: 
tempevent->assocproc  =  e->assocproc: 


tempevent->assocmem  =  tempqueue->getgmid(); 
eventqueue->enqueue(tempevent); 

} 

else{ 

tempevent->eventname  =  start_write_queue; 
tempevent->starttime  =  tempproc->breakdownbusytiil; 

tempevent->priority  =  tempevent->starttime: 
tempevent->nodenum  =  e->nodenum; 
tempevent->queuenujn  =  e->queuenum; 
tempevent->assocproc  =  e->assocproc; 
eventqueue->enqueue(tempevent); 

l; 

I 

else{ 

tempproc->setbreakdownbusytiU(tempmemory->getbusytiine()); 
tempevent->eventname  =  e->eventname; 
tempevent->starttime  =  tempmemory->getbusynme(); 
tempevent->priority  =  tempevent->stamime: 
tempnode  =  gnodelist->getnode(e->nodenum); 
tempevent->nodenum  =  e->nodenum; 
tempevent->queuenuin  =  e->queuenum; 
tempevent->assocproc  =  e->assocproc; 
tempevent->assocmem  =  tempqueue->getgmid(); 
eventqueue->enqueue(tempevent); 

I: 

I; 

1; 

break; 

}; 

//Finishes  reading 
case  Fmish_reading;  ( 

tempproc=pl->getproc(e->assocproc); 

tempproc->readinginprocess=faise; 

tempnode=gnodelist->getnode(e->nodenum); 

tempmemory=ml->getmem(e->assocmem); 

tempmemory->setbusy(false); 

tempnode->incnumreadqs(); 

//If  this  is  the  last  queue  to  be  read,  finish  setup 
if  (tempnode->lastnoderead())( 
tempnode->setnumreadqs(); 


consume_q(e->queuenum); 
tempevent->eventname  =  finish_setup; 
tempevent->starttime  =  clock; 
tempevent->priority  =  clock; 
tempevent->nodenum  =  e->nodenum; 
tempevent->queuenum  =  e->queuenum; 
tempevent->assocproc  =  e->assocproc; 
tempevent->assocmem  =  e->assocmem: 
eventqueue->enqueue(tempevent); 

I: 

break: 

1; 


//Finishes  the  node  setup  if  execution  stage  is  not  busy 
case  finish_setup:{ 

tempproc=pl->getproc(e->assocproc); 

tempnode=gnodelist->getnode(e->nodenum); 

if(!(tempproc->isexecbusy())){ 

//Free  the  processor 
tempproc->setsetupbusy(false); 
tempevent->eventname=free  _processor; 
tempevent->starttime=clock; 
tempevent->priority  =clock; 
tempevent->assocproc=e->assocproc; 
eventqueue->enqueue(tempevent); 
tempproc->onlyexectime  =  tempproc->onlyexectime  + 
tempnode->getexectune(): 

tempevent  =  new  event; 
tempevent->eventnaine=start_execution; 
tempevent->starttime  =  clock; 
tempevent->priority  =  clock; 
tempevent->nodenum  =  e->nodenum; 
tempevent->assocproc  =  e->assocproc; 
eventqueue->enqueue(  tempevent); 

} 

else(//  Execution  stage  is  busy  try  to  finish  setup 
//When  execution  is  finished 
tempproc->waitingonexec  =  true; 

if  {tempproc->waitingonsetup)  |  //This  part  prevents  deadlock 
tempproc  >setexecbusy(  false): 
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tempproc->waitingonsetup=false; 

tempproc->waitingonexec=false: 

tempevent->starttime=clock: 

} 

else 

tempevent->starttime=tempproc->getexecbusytime(); 

tempproc->setsetupbusytill(tempevent->starttiine); 
tempevent->eventname  =  finish_setup: 
tempevent->priority  =  tempevent->starttime; 
tempevent->nodenum  =  e->nodenum; 
tempevent->assocproc  =  e->assocproc; 
eventqueue->enqueue(tempevent); 

); 

break: 

1: 

//Finishes  writing  process 
case  finish_writing:  j 

tempproc=pl->getproc(e->assocproc); 

tempproc->writeinprocess=false; 

tempqueue=gqueuelist->getqueue(e->queuenum); 

tempnode  =gnodelist->getnode(e->nodenum); 

tempnode->incnumwriteqsO: 

tempmemory=ml->getmem(e->assocmem); 

tempmemory->setbusy(false); 

/Ancrement  current  length  of  queue 

tempqueue->inccurrentlength(tempqueue->getproductionqty()): 
//Decrement  writeamount  since  queue  is  physically  written 
tempqueue->writeamount=tempqueue->writeamount- 
tempqueue->getproductionqty(); 

/A f  this  is  the  last  queue  to  be  written  finish  breakdown 
if  (tempnode->lastqueuewrite()){ 
tempnode  ->setr  um  writeqs(); 
tempevent->eventname=finish_breakdown: 
tempevent->starnime  =  clock: 
tempevent->priority  =  clock; 
tempevent->nodenum  =  e->nodenum; 
tempevent->queuenum  =  e->queuenum; 
tempevent->assocproc  =  e->assocproc: 


tempevent->assocmem  =  e->assocmem; 
eventqueue->enqueue(tempevent); 

»: 

//If  produeced  data  makes  the  queue  overthreshold 
if  (tempqueue->isoverthreshold())( 
tempevent  =  new  event: 
tempevent->eventnaine=queue_ovenhreshold; 
tempevent->starttime=clock; 
tempevent->priority=clock; 
tempevent->nodenum=tempqueue->getsinknode(); 
tempevent->queuenum=e->queuenum; 
eventqueue->enqueue(tempevent); 

I: 

break; 


//Finishes  the  node  breakdown 
case  Finish_breakdown:  ( 

tempnode  =  gnodelist->getnode(e->nodenum); 
tempproc=pl->getproc(e->assocproc); 
tempproc->setbreakdownbusy(false); 
if  (((!(tempproc->issetupbusy()))&& 
(!(tempproc->isexecbusyO)))H 
finishinstance=(  instancenum- 1 ))  { 
tempproc->durationprocbusy=tempproc->durationprocbusy+ 
clock-tempproc->startutil; 
tempproc->startutil=clock; 

I; 

tempevent  =  new  event; 
tempnode=gnodelist->getnode(e->nodenum): 
tempnode->processing  =  false; 
tempevent  =  new  event; 
if  (tempnode->getnodetype()==output) 
tempnode->executed  =  true; 
if(arealloutputnodesexecuted())  | 
initializeoutputnodesO; 
finishinstance-H-; 
if  (fmishinstance  >=  iuststart) 
endfile  «  fmishinstance  «  “  “  «  clock  «  endl; 

I; 
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//If  ready  node  list  is  not  empty 
if  (!(sch->emptyrl()))( 

tempevent->eventname  =  schedule_a_node_from_rl; 
tempevent->starttime  =  clock; 
tempevent->priority  =  clock; 
eventqueue->enqueue(tempevent); 

I: 

break; 

): 


//  If  queue  is  overthreshold  check  the  other  input  queues 
//If  all  overthreshold  indicate  node’s  input  data  is  ready 
case  queue_oveithreshold:  { 

tempnode=gnodelist->getnode(e->nodenum); 
if  (areallqsot(tempnode))| 

increasereadamount(tempnode); 
tempevent->eventname=inqsoverthreshold; 
tempevent->starttime=clock: 
tempevent->priority  =clock; 
tempevent->nodenum  =  e->nodenum; 
eventqueue->enqueue(tempevent); 

I; 

break; 


//Marks  scheduler  as  not  busy 
case  free_scheduler:  { 
sch->setbusy(false); 

//Checks  if  there  is  a  ready  node  which  is  waiting  for 
//scheduler  try  to  schedule  a  node  from  rl 
tempevent->eventname  =  schedule_a_node_from_rl; 
tempevent->starttime  =  clock; 
tempevent->priority  =  tempevent->starttime: 
eventqueue->enqueue(tempevent); 
break; 

}; 

case  none:  ( 
break; 

I; 

I; 
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| ;  //end  of  procesevent 


//  Produces  read  queue  events  for  each  input  if  it  is  not  an  input  node, 
void  simulator: :produce_readqueues(gnode  *n,event*  e){ 
intlist  *1; 

event  *  tempevent ; 

1  =  n->getinputqueuelist(); 

//If  it  is  an  input  queue,  then  after  fixed  amount  delay  finish  setup 
if  (n->getnodetype()  =  input)  { 
tempevent=new  event; 
tempevent->eventname  =  finish_setup; 
tempevent->starttime  =  clock; 
tempevent->priority  =  tempevent->starttime; 
tempevent->nodenum  =  e->nodenum; 
tempevent->assocproc  =  e->assocproc; 
eventqueue->enqueue(  tempevent); 

I 

else{ 

//For  internal  nodes  produce  read  queues 

for(l->current=l->head;l->curTent;l'>curTent=l->cuiTent->nexmode){ 
tempevent=new  event; 
tempevent->eventname  =  startjreadqueue; 
tempevent->starttime  =  clock; 
tempevent->priority  =  clock; 
tempevent->nodenum  =  n->getnodeid(); 
tempevent->queuenum  =  l->curTent->number; 
tempevent->assocproc  =  e->assocproc; 
eventqueue->enqueue(  tempevent); 

I; 

I; 


//Produces  write  queue  events  for  each  output  queue  if  node  is  not  an 
//output  node 

void  simulator::produce_writequeues(gnode  ‘n.event*  e){ 
intlist  *1; 

event  *  tempevent; 

1  =  n->getoutputqueuelist(); 

//If  node  is  an  output  node  finish  breakdown  after  fixed  amount  of  delay 
if  (n->getnodetype()  =  output)  { 
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tempevent=new  event; 
tempevent->eventname  =  finish_breakdown; 
tempevent->starttime  =  clock; 
tempevent->priority  =  tempevent->starttime; 
tempevent->nodenum  =  e->nodenum; 
tempevent->assocproc  =  e->assocproc; 
eventqueue->enqueue(tempevent); 
l 

else( 

//If  node  is  an  internal  node  produce  write  queue  event  for  each  output 
//queue 

for(l->current=l->head;l->current:l->current=l->current->nextnode){ 
tempevent  =  new  event; 
tempevent->eventname  =  start_write_queue; 
tempevent->starttime  =  clock; 
tempevent->prioriiy  =  clock; 
tempevent->nodenum  =  n->getnodeid(); 
tempevent->queuenum  =  l->current->nuniber; 
tempevent->assocptoc  =  e->assocproc; 
eventqueue->enqueue(tempevent); 

}; 

); 

1; 

//Consumes  the  input  queues  by  decreasing  the  threshold  amount 
void  simulator: :consume_q(int  num){ 
intlist  *1; 

Queueltem  *temp; 
boolean  flag; 
gnode  *n  *m; 

temp=gqueuelist->getqueue(l->current->number); 

n=gnodelist->getnode(temp->nodeout); 

flag  =  temp->isthereenoughspace(); 

temp->cuiTentlength=temp->cuiTentlength-temp->getthreshold(); 

temp->readamount=temp->readamount-temp->getthresholdO; 

//If  consumption  makes  room  for  the  waiting  nodes 
if  (!flag)f 

//get  source  node 

m  =  gnodelist->getnode(temp->nodein); 
if  (temp->isthereenoughspace())  ( 

//If  all  output  qs  are  ready  and  if  source  node  is  waiting 


//for  the  output  become  ready 
if  (arealloutputqsready(m)&&  m->waitingforoutput)( 
m->waitingnumber-; 
if  (m->waitingnumber  =  0) 
m->waitingforoutput  =  false; 
tempevent  =  new  event; 
tempevent->evenmame=ready_node: 
tempevent->starttime=clock: 
tempe  vent->priority=c  lock; 
tempevent->nodenum=m->getnodeid(); 
eventqueue->enqueue(  tempevent): 


I; 


//If  sink  node  still  overthreshold  and  it  is  not  an  input  node 
//indicates  that  input  queues  are  still  overthreshoid 
if  (areallqsot(n)&&(n->getnodetype()  !=input»  I 
increasereadamount(n); 
tempevent  =  new  event; 
tempevent->eventname=inqsoverthreshold; 
tempevent->starttime=clock; 
tempevent->priority=clock; 
tempevent->nodenum=n->getnodeid(); 
eventqueue->enqueue(tempevent); 

h 


//Returns  boolean  if  all  input  queues  are  overthreshoid 
boolean  simulaton:areallqsot(gnode  *n){ 

Queueltem*  q; 

indist*  ql  =  n->getinputqueuelist(); 

for  (ql->current=ql->head;ql->current;ql->current=ql->current->nextnode)  ( 
q  =  gqueuelist->getqueue(ql->current->number); 
if  ((q->currentlength-q->readamount)<  q->thresholdqty) 
return  false; 

J; 

return  true; 


//Increases  the  read  amount  of  input  nodes.  It  is  used  to  prevent,  input  queues 
//becomeoverthreshold  even  it  is  put  in  to  ready  list 
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void  simulator::increasereadamount(gnode  *n){ 

Queueltem*  q; 

intlist*  ql  =  n->getinputqueuelist(); 

for  (ql->current=ql->head:ql->current:ql->current=ql->current->nextnode)i 
q  =  gqueuelist->getqueue(ql->current->number); 
q->readamount  =  q->readamount  +q->consumptionqty; 


//Decreases  the  read  amount  of  input  queues 
void  simuIator::decreasereadamount(gnode  *n){ 

Queueltem*  q; 

intlist*  ql  =  n->getinputqueuelist(); 

for  (ql->current=ql->head:ql->current:ql->current=ql->current->nextnodeM 
q  =  gqueuelist->getqueue(ql->current->number); 
q->readamount  =  q->readamount  -q->consumptionqty; 

}; 


//Increases  the  write  amount  of  output  queues  to  prevent  putting  an  extra  ready 

//node  which  will  cause  overcapacity 

void  simulator::increasewriteamount(gnode  *n)| 

Queueltem*  q; 

intlist*  ql  =  n->getoutputqueuelist(): 

for  (ql->current=ql->head;ql->current:ql->current=ql->curTent->nextnode){ 
q  =  gqueuelist->getqueue(ql->current->number); 
q->writeamount  =  q->writeamount  +q->productionqty; 


//Returns  true  if  all  output  queues  has  enough  space  to  store  the  result  of 
//source  node  execution 

boolean  simulator:arealloutputqsready(gnode  *n){ 

Queueltem*  q; 

intlist*  ql  =  n->getoutputqueuelist(); 

for  (ql->current=ql->head;ql->current;ql->current=ql->curTent->nextnode)  ( 
q  =  gqueuelist->getqueue(ql->current->number): 
if  ((q->capacity  -  q->currentlength-q->writeamount)<  q->productionqty) 
return  false; 

I: 

return  true; 


71 


}; 

//Return  true  if  all  output  nodes  of  an  instance  is  executed  or  not 
boolean  simulator :arealloutputnodesexecuted()| 
fort  gnodelist->current=  gnodelist->head: 
gnodelist->current; 

gnodelist->current=gnodelist->cun'ent->nexutem) 
if  (gnodelist->current->element->getnodetype()==outpiit) 
if  (gnodelist->cun'ent->element->executed=faIse) 
return  false; 
return  true; 

1: 

//Initialize  the  output  queues  execution  flag 
void  simulator;;  initializeoutputnodes()  { 

for(gnodelist->cuiTent=  gnodelist->head; 
gnodelist->cuiTent; 

gnodelist->current=gnodelist->current->nextitem) 
if  (gnodelist->current->element->getnodetype()==output){ 
gnodelist->current->element->exectimes-; 
if  (gnodelist->current->element->exectimes==0) 
gnodelist->current->element->executed=false; 


//Prints  the  statistical  information 

void  simuIator::printstatistics(fstream&  startfile,fstream&  endfile. 

int  instnumjnt  endinst.double  drate,  double  tottime){ 

double  end, 
start, 

difference, 

sum, 

squaresum, 

average, 

variance; 

double  averageexectime  =0.0; 
double  averagebusytime  =0.0; 
fstream  aputil,thrput,lenvar,instlen,onlyexec; 
int  cnt=0; 
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COUt  «  “*************** *************  j*****************************^”. 

cout «  ' 

cout «  “**  **\n”; 
cout  «  “**  STATISTICS  **\n"; 
cout «  “**  **\n”; 
cout «  "**  **\n”; 
cout « ' 


COlit  <<  ■■******  ******************************  ************  ********* 

cout «  endl; 

cout«“PROCESSOR  ID :  PROCESSOR  TYPE  :  PROCESSOR  UTILIZATION!  with  comm:\n” 
for  (pl->current=pl->head;pl->current;pl->cuiTent=pl->current->nextproc)( 
cout «“  “  «  pl->current->p->procid  «  " 
cout «  pl->current->p->ptype  «  “ 

cout «  double((pl->cuiTent->p->durationprocbusy))/double(clock) «  endl; 
if(pl->current->p->gettype()!=io)  { 
averageexectime  =  averageexecdme  + 
double!  (pl->current->p->onlyexectime))/double(clock); 
averagebusytime  =  averagebusytime  + 
doubIe((pl->curTent->p->durationprocbusy))/double(clock); 
cnt++; 


for  (gnodelist->current=gnodeUst->head; 

gnodelist->current;gnodelist->curTent=gnodelist->curTent->nextitem) 

communicaiion=communication+gnodelist->ciuTent->element->aissLze*comm: 

for  (gqueuelist->current=gqueuelist->head; 

gqueuelist->current;gqueuelist->current=gqueuelist->cuiTent->nextitem) 
communicadon=communicalion+ 
gqueuelist->current->element->consumptionqty*comm; 
startfile.open(‘‘starttimes”,ios: :  in); 
endfile.open(“endtimes”,ios::in); 
aputil.open(“grphutiTjos::applios::out); 
onlyexec.open(“grphexectime”,ios::applios::out); 
instlen.open(“grphresptime”>ios::applios::out); 
lenvar.open("grphinstlenvar”,ios::applios::out); 
thrput.open(“grphthroughput”,ios::applios::out); 
sum  =  0; 
squaresum=0; 
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double  stime; 

for  ( int  loop=l;loop  <=instnum;loop++)( 
startfile  » start; 
startftle  » start; 
endfile  »  end; 
endflle  »  end; 
if  (loop  =  1) 
stime  =  end; 
difference  =  end  -  start: 
sum  =  sum  +  difference: 

squaresum  =  squaresum  +  (difference  *  difference); 

I; 

stime=end  -  stime; 
average  =  sum  /  double(instnum); 
cout «  "DATA  RATE  «  drate  «  endl; 
cout «  “AVERAGE  RESPONCE  TIME  «  average  «  endl; 
variance  =  (squaresum-(double(instnum)*sqr(average)))/double(instnum); 
startfile.close(): 
endfile.closeO; 

cout «  “AVERAGE  THROUGHPUT  « ((instnum-l)*1000000)/stime«endl; 
cout «  “SIMULATION  TIME  :”«  clock  «  endl; 

cout «  “INSTANCE  LENGTH  STANDARD  VARIATION «  sqrt(  variance) «  endl: 
cout «  “COMMUNICATION  OF  ONE  INSTANCE  :'«communication; 
cout «  endl; 

cout «  “COMPUTATION  OF  ONE  INSTANCE 
cout « tottime«  endl; 

cout «  “COMMUNICATION  /  COMPUTATION  RATIO 
cout «  communication  /  tottime; 
cout «  endl; 
cout «  endl; 

aputil «  drate  «  “  “  «  averagebusytime/double  (cnt)  «endl; 
onlyexec«  drate  «  “  “  «  averageexectime/double  (cnt) «  endl; 
instlen  «  drate  «  “  “  «  average  «  endl; 
thrput «  drate  «  “  “  « ((instnum-l)*1000000)/stime«endl: 
lenvar  «  drate  «  “  “  « sqrt(variance)/average  «  endl; 
aputil  .close(); 
onlyexec.cIoseO; 
instlen.close(); 
lenvar.closeO; 
thrput.close(); 
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main()l 

simulator  *s: 
s  =  new  simulator 

s->simulate(): 
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//  Author  :  Cem  Akin 
//  Advisor  :  Amr  Zaky 
//  Description  :  Event  Class  header  file. 
//  Date  :  12  November  1992 
//  Last  Revised  :  03  January  1993 

#include  <iostream.h> 

#include  <fstream.h> 

#include  "global.h” 

#ifndef  E  VENT_H 
#define  EVENT_H 

class  event! 
public: 
event(); 

event_type  geteventname(); 
int  getpriorityO; 

void  printevent(fstream&); 


event_type  eventname: 
double  starttime, 
priority; 

int  nodenum. 
queuenum, 
assocproc, 
assocmem; 

I: 


#endif 


//  Author  :  Cem  Akin 
//  Advisor  :  Amr  Zaky 
//  Description  :  Event  Class  source  file. 
//  Date  :  12  November  1992 
//  Last  Revised  :  03  January  1993 

#include  "event.h" 

//Event  class  constructor 
event::event()( 

eventname  =  none: 
starttime  =  0.0: 
priority  =0.0; 
nodenum  =  0; 
queuenum  =  0; 
assocproc  =  0; 
assoc  mem  =0: 


//Returns  the  events  priority 
int  event::getpriority(){ 
return  priority; 


//Returns  event  name 
event_type  evem::geteventname()( 
return  eventname; 


//Prints  the  event  to  the  log  file 

void  event::pnntevent(fstream  &  logfi!e)f 

logfile  «  eventname  «“  “  «  starttime  «  “  "  «  priority  «  "  "  ; 
logfile  «  nodenum  «  “  “  «  queuenum  «  "  “  «  assocproc  «  “  \n ' : 

logfile  «  assocmem  «endl; 
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//  Author  :  Cem  Akin 

//  Advisor  :  Amr  Zaky 

//  Description  :  GNODE  Class  (Graph  node). 

//  Date  :  12  November  1992 

//  Last  Revised  :  03  January  1993 

#include  <fstream.h> 

#include  “ilist.h” 

#include  ‘qlist.h’’ 

#ifndef  GNODE_H 
#defme  GNODE_H 

class  gnode  { 
public: 

gnode(); 

~gnode(); 

int  loadnode(fstream&.int.int); 
int  getnodeidO; 
int  getnodepriorityO; 
int  getdataperiodO; 
int  getsetuptimeO; 
int  getbreakdowntimeO: 
int  getexecrimeO; 
int  getdatarateO; 
void  incnumreadqsO; 
void  incnumwriteqs(); 
void  setnumreadqs()( 
numreadqs=0: 

»: 

void  setnumwriteqsOI 
numwriteqs=0: 

I: 

void  incnodeinstance()( 
nodeinstance-H-; 

I: 

int  getnodeinstance()( 
return  nodeinstance; 

}; 

boolean  lastnoderead(); 
boolean  lastqueuewrite(); 
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proctvpe  getproctype(); 

proctype  getaltproctypeO; 

intlist*  getinputqueuelist(): 

intlist*  getoutputqueuelistf); 

friend  ostream&  operator«(ostream&.gnode&); 

boolean  operator<(gnode&); 

nodetype  getnodetype(): 

//private: 

int  nodeid, 
priority, 
memid, 
aissize, 
lastinstance, 
dataperiod, 
numreadqs, 
numwriteqs, 
total  inqs, 
totaloutqs, 
exectime, 
setuptime, 
breakdown, 
nodeinstance, 
waitingnumber, 
readycount, 
exectimes, 
order; 

double  proctime: 
boolean  processing, 
waitingforoutput, 
executed; 
nodetype  ntype; 
proctype  protype, 
altproctype; 
intlist  *inqlist, 

*outqlist; 


#endif 
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//  Author  :  Cem  Akin 
//  Advisor  :  Amr  Zaky 
//  Description  :  GNODE  Class  source  code. 
//  Date  :  12  November  1992 
//  Last  Revised  :  03  January  1993 

#inc!ude  <stdiib.h> 

#include  "‘gnode.h" 

#include  <iostream.h> 

//Gnode  class  contructor 

gnode::gnode01 
proctime  =  0.0; 
nodeid  =  0; 
priori  ty=0; 
memid  =  0: 
aissize=0; 
lastinstance=0; 
nodeinstance=-l; 
exectime=0; 
setuptime=0; 
breakdown^); 
dataperiod=0; 
ntype  =  instruction; 
protype  =  inst; 
altproctype  =  inst; 
readycount  =  0; 
numreadqs  =  0; 
numwriteqs  =  0; 
totalinqs  =  0; 
totaloutqs  =  0: 
waitingnumber  =  0; 
inqlist  =NULL; 
outqlist  =  NULL; 
processing  =  false; 
waitingforoutput  =  false; 
executed  =  false; 

I: 

//Gnode  class  destructor 
gnode::~gnode()  { 


delete  inqlist; 
delete  outqlist; 
inqlist  =  NULL; 
outqlist  =  NULL; 


//Loads  a  single  graph  node  from  given  stream 

int  gnode::loadnode(fstream&  grphfile.int  pchoice.  int  gid){ 

int  ttype. 

extime; 

srand(int(time(NULL))); 
giphfile  »  nodeid; 
grphfile  » ttype; 
ntype  =  nodetype(ttype); 
memid  =  gid; 
grphfile  »  aissize: 
grphfile  »  exectime; 
extime  =  exectime; 
if  (ttype  !=  0) 
extime  =  0; 
grphfile  »  senjptime; 
grphfile  »  breakdown; 
grphfile  » ttype; 


switch  (pchoice)  { 
case  1  :{ 

priority  =  exectime; 
break; 

}; 

case  2 ;{ 

priority  =  -  aissize; 
break; 

1: 

case  3  :( 

cout «  “ENTER  NODE  PRIORITY 
cin  »priority; 
cout «  endl; 
break; 

): 
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case  4  : ( 

priority  =  rand(); 
break; 

I: 

case  5 : 
break: 

I: 

switch  (ttypeH 
case  0: 1 

protype  =  inst: 
break: 

>: 

case  1:{ 

protype  =  io; 
break; 

}: 

case  2:  ( 

protype  =  sec; 
break; 

}: 

case  3:{ 

protype  =  three; 
break; 

}; 

case  4:  | 

protype  =  four, 
break; 

}; 

case  5:{ 

protype  =  five; 
break; 

I; 

case  6:( 

protype  =  six; 
break; 

1; 

case  7:{ 

protype  =  seven; 
break; 

I; 


case  8;  i 


protype  =  eight; 
break; 

}: 

}: 

grphfile  » ttype; 
switch  (ttype)  ( 
case  0:  ( 

aJtproctype  =  inst; 
break; 

1: 

case  1:( 

altproctype  =  io; 
break; 
t; 

case  2:  { 

altproctype  =  sec; 
break; 

); 

case  3:{ 

altproctype  =  three; 
break; 

}; 

case  4;  { 

altproctype  =  four, 
break; 

}; 

case  5:{ 

altproctype  =  five; 
break; 

1; 

case  6;  { 

altproctype  =  six; 
break; 

1; 

case  7:{ 

altproctype  =  seven; 
break; 

I: 

case  8:( 

altproctype  =  eight; 
break; 
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inqlist  =  new  intlist; 
outqlist  =  new  intlist: 


return  extime; 


//Prints  a  graph  node 

ostream&  operator«(ostream&  os.gnode&  g)  | 

os  «  “  ID  «  g.nodeid  «  “node  type:”  «  g.ntype  «  endl; 
os  «  “AIS  «  g.aissize  «  “  Exe  time  «  g.exectime  «  endi; 
os  «  “setup  time”  «  g.setuptime  «  endl: 
os  «  “Memory  :  “«g.memid«endl; 
os  «  *g.inqlist «  endl; 
os  «  *g.outqlist «  endl; 

/*  os  «  “  priority  :”  «  g.priority  «  endl; 

os  «  “proctype «  g.protype«“  altproctype  “  «g.altproctype«endl;*/ 
return  os; 

I: 


//Returns  input  queue  list 
intlist*  gnode::getinputqueuelist(){ 
return  inqlist; 

): 


//Returns  output  queue  list 
intlist*  gnode::getoutputqueuelist()| 
return  outqlist; 

I; 


//Returns  node  id 
int  gnode::getnodeid()( 
return  nodeid; 


//Returns  node  priority 
int  gnode::getnodepriority()| 
return  priority; 

1: 

//Returns  dataperiod 
int  gnode::getdataperiod(M 
return  dataperiod; 

I: 

//Returns  setuptime 
int  gnode::getsetuptime()( 
return  setuptime; 

}; 

//Returns  processor  type  that  node  can  run  on 
proctype  gnode::getproctypeO( 
return  protype; 

); 

//Returns  alternative  processor  type  that  node  can  run  on 
proctype  gnode;;getaltproctype()  { 
return  altproctype; 

1: 

//Returns  dataperiod 
int  gnode::getdatarate()( 
return  dataperiod; 

1; 

//Returns  Node  execuiton  time 
int  gnode::getexectime(){ 
return  exec  time; 

}: 

//Increments  number  of  queues  that  are  already  read 
void  gnode::incnumreadqs()( 
numreadqs++; 


//Increments  number  of  queues  that  are  already  written 
void  gnode::incnumwriteqs()( 
numwriteqs++; 


//Returns  true  if  last  input  queue  of  the  node  has  been  read 
boolean  gnode::lastnoderead(){ 
if  (numreadqs=totalinqs) 
return  true; 
else 

return  false: 

k 


//Returns  true  if  the  last  output  queue  of  the  node  has  been  written 
boolean  gnode::lastqueuewrite(){ 
if  (num  writeqs=totaloutqs) 
return  true; 

else 

return  false; 

1; 


//Returns  the  fixed  breakdown  time 
int  gnode::getbreakdowntime(){ 
return  breakdown; 

I; 


//Returns  the  node  type 
nodetype  gnode::getnodetype()( 
return  ntype; 

1; 


boolean  gnode::operator<(gnode&  n)( 
if  (n.order>  order) 
return  true; 
else 

return  false; 

1: 
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#include  "global.h” 

#include  <fstream.h> 
class  nlist; 

#ifndef  QUEUE  ITEM_H 
#define  QUEUEITEM_H 
class  Queueltem{ 
public  : 

QueueltemO; 

void  loadqueue(fstream&,int,  nlist*): 
int  getgmid(): 
int  getthreshold(); 
int  getsinknode(): 
int  getsourcenode(){ 
return  nodein; 

I: 

int  getproductionqtyO; 
boolean  isoverthresholdO: 
boolean  isthereenoughspace(){ 

if  ((capacity  -  currentlength-writeamount)  >=  productionqty) 
return  true; 
else 

return  false; 


void  inccurrentlength(int); 

friend  ostream  &  operator«(ostream&,QueueItem&); 


//  private  : 
friend  class  Qlist; 
int  queueid, 
gmid, 
nodein, 
nodeout, 
thresholdqty, 
consumptionqty, 
capacity, 
currentlength, 
productionqty, 
writeamount, 
readamount; 
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boolean  overthreshold, 
overcapacity; 

queuetype  qtype; 

): 

#endif 
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//  Author  :  Cem  Akin 
//  Advisor  :  Amr  Zaky 

//  Description  :  GQUEUE  Class  Head  r  fde(Graph  Queue). 
//  Date  :  12  November  1992 
//  Last  Revised  :  03  January  1993 

#include  <iostream.h> 

#include  "gqueue.h" 

#include  ‘‘nlist.h’' 

//Constructor 

QueueItem::QueueItem()  ( 

queueid  =  0; 

gmid  =  1; 

nodein  =  0; 

nodeout  =  0: 

thresholdqty  =  0; 

currentlength=0; 
capacity  =  0; 

writeamount=0: 
productionqty=0; 
readamount=0: 
overthreshold  =  false; 
overcapacity  =  false; 
qtype  =  data; 

1; 


//Loads  one  queue  from  given  stream 

void  QueueItem:;loadqueue(fstream&  grphfile.int  gid.nlist’,'  nodelist){ 
int  temp; 

grphftle  »  queueid; 
grphftle  » temp; 
if  (temp==0) 
qtype  =  syn_arc; 
else 

qtype  =  data; 

gtphftle  »  nodein; 

for  (nodelist->current  =  nodelist->head; 
nodelist->current; 

nodelist- >current=nodelist->curTent->nextitem)| 

if  (nodelist->current->element->getnodeidO=nodein)  i 
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nodelist->current->element->outqlist->addtolist(queueid): 

nodeiist->current->element->totaloutqs++: 


grphfile  »  nodeout; 

for  (nodelist->current  =  nodelist->head: 
nodelist->current: 

nodelist->current=nodelist->current->nexmem){ 
if  (nodelist->current->element->getnodeidO=nodeout){ 
nodelist->current->element->inqlist->addtoiist(queueid); 
nodelist->current->element->totalinqs++: 
l: 


grphfile  » thresholdqty; 
grphfile  »  currentlength: 

grphfile  »  productionqty; 
consumptionqty=productionqty; 
grphfile  »  capacity; 
if  (currentlength  >=  thresholdqty) 
overthreshold  =  true; 
gmid  =  gid; 


//Returns  Memory  id  that  queue  is  assigned 
int  Queueltem::getgmid()( 
return  gmid; 

}: 


//Returns  threshold  Quantity 
int  Queueltem::getthreshold(){ 
return  thresholdqty; 


//Prints  a  queue 

ostream&  operator«(ostream&  os.QueueItem&  q)  ( 

os  «  “ID  «  q.queueid  «  “NODE  IN  “  «  q. nodein  «  “NODEOUT  “  «  q.nodeout  «  endl 
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os  «  "MEMORY  «  q.gmid«endJ; 
os  « ‘THRES  : "  «  q.thresholdqty  «  endl: 


os  «  “CAPAC  :  “  «  q.capacity  «  “DATARATE  “  «  endl: 

os  «  "LENGTH  “  «  q.currentlength  «  "overthreshold  «  q.overthreshold  «endl; 

return  os: 


//Increments  the  current  length  of  the  queue  by  production  quantity 

void  Queueltem::inccurrendength(int  t){ 
currentlength=currentlength+t: 
if  (currentlength  >=  thresholdqty) 
overthreshold  =  true: 
else 

overthreshold  =  false; 


//Returns  sink  node 
int  QueueItem::getsinknode(){ 
return  nodeout; 


//Returns  true  if  queue  is  overthreshold 
boolean  QueueItem::isoverthreshold()l 

if  ((currentlength-readamount)>=thresholdqty) 
return  true; 
else 

return  false; 

1: 


//  Returns  production  quantity 
int  QueueItem::getproductionqty<){ 
return  productionqty; 
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//  Author  :  Cem  Akin 
//  Advisor  :  Amr  Zaky 

//  Description  :  NLIST  CLASS  header  fde(Node  List). 
//  Date  :  12  November  1992 
// Last  Revised  :  03  January  1993 

#indude  '‘gnode.h” 

#include  "qlist.h" 

# include  <fstream.h> 

#ifndef  NL1ST_H 
#define  NLIST_H 

class  nlistj 
public: 
nlist(); 

~nlist(); 

gnode  *getnextnode<){ 

return  current->nextitem->element; 

I: 

int  sort_topologically(Qljst*); 
gnode*  getnode(int); 
int  loadnodes(fstream&,fstream&,int,int); 
friend  ostream&  operator«(ostream&,nlist&); 


struct  nitem{ 

gnode  ‘element; 
nitem  ‘nextitem; 


nitem  ‘head, 
‘current; 


#endif 
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//  Author  :  Cem  Akin 

//  Advisor  :  Amr  Zaky 

//  Description  :  NLIST  CLASS  source  code. 

//  Date  :  12  November  1992 

//  Last  Revised  :  03  January  1993 

#include"pnqueue.h" 

#include"nlist.h” 

//Constructor 

nlist::nlist(){ 

head  =  NULL; 
currents  NULL; 

}; 


//Destructor 
nlist::~nlist()( 
delete  head; 
delete  current; 
head  =  NULL; 
current  =NULL; 


//Loads  graph  nodes 

int  nlist::loadnodes(fstream&  grphfile.fstream&  mem.int  choicejnt  nummem)( 
int  num  nodes  =  0: 
int  exectime  =  0; 
int  gid; 

nitem  *tempn; 

grphfile  »  numnodes; 
for  (int  loop  =  l;loop  <=  numnodes;loop++)l 
mem  »  gid; 
gid  =  gid  %  nummem+1; 
tempn=  new  nitem; 
tempn->element  =  new  gnode; 
if  (head  =  NULL)! 

exectime=exectime+tempn->element->loadnode(grphfile.choice.gid); 
current  =  head  =  tempn; 

I 
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else{ 

exectime=exectime+tempn->element->loadnode(grphfile.choice,gid); 
current->nextitem  =  tempn; 
current  =  current->nextitem; 


return  exectime; 

I: 

//Get  the  node  whose  id  number  is  given 
gnode  *  nlist::getnode(int  nid){ 

for  (current=head:current;current=current->nextitem) 
if  (current->element->getnodeid()=nid) 
return  current~>element; 
if  (current  =  NULL) 

cen  «nid«  “***ERROR  undefined  node  id\n"; 


}: 


//Prints  a  node 

ostream&  operator«(ostream&  osjtlist&  n)( 
if  (n.head  ==  NULL) 

os  «  -LIST  IS  EMPTYVT; 

else{ 

for(n.curTent=n.head;n.current:n.current=n.current->nextitem) 
os  «  *n.current->element «  endl: 


return  os; 

): 


int  nlist;:sort_topologically(Qlist  *qlst)l 
int  count, 
loop; 

int  g_labei  =  1; 
int  in_degrees[100]; 
int  nodes[100)[2]; 
pnqueue  *q: 
intlist  *ql; 
intnode  *tempnode; 
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Queueltem  *qu; 
gnode  *nd; 
count  =  0; 
q  =  new  pnqueue; 

for  (current=head:currenucurrent=curTent->nextitem)  ( 
nodes[countl[l]  =  current->element->nodeid; 
cout «  nodes[count][l]  «  ”  “«  current->element->totalinqs«en(Il; 
in_degrees[count]  =  current->element->totalinqs: 
count++; 


count--; 

for  (loop  =0;loop<=  count;loop-H-) 
if  (in_degrees[loop]  ==  0)( 

tempnode  =  new  intnode; 
tempnode->nid=nodes[loop][  1  ]; 
tempnode->priority=0; 
q->enqueue(tempnode); 

I; 


while  (tempnode=q->dequeue())( 
for  ( loop=0;  loop<=count;loop+-t-) 
if  (tempnode->nid=nodes[loop][l]) 
nodes(loop][2]=g_label; 
nd  -  getnode(tempnode->nid); 
nd->order  =  g_label; 
ql  =  nd->getoutputqueuelist(); 

for  (ql->current=ql->head;ql->current:ql->current=ql->curTent->nexmode)  { 
qu  =  qkt->getqueue(ql->current->number): 
for  (loop=0;loop<=count;loop++) 
if  (qu->nodeout=nodes[loop][l]){ 
in_degrees(loop]=in_degrees[loop]- 1 ; 
if  (in_degrees[loop]=0){ 
tempnode  =  new  intnode; 
tempnode->nid=nodes[loop][  I]; 
tempnode->priority=0; 
q->enqueue(tempnode); 
i: 

I: 

>: 
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g_iabel  =  g_label+l: 


for  (curTent=head;current;current=current->nextitern) 
cout «  current- >element->nodeid«“-”«current->element->order«endl; 


return  g_label  -  1; 
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//  Author  :  Cem  Akin 
//  Advisor  :  Amr  Zaky 

//  Description  :  QLIST  CLASS  header  file(Graph  Queue  List). 
//  Date  :  12  November  1992 
//  Last  Revised  :  03  January  1993 

#include  "gqueue.h" 

#ifndef  QLIST_H 
#define  QLIST_H 

class  Qlist{ 
public: 

Qlist();  //constructor 

~Qlist();  //destructor 

Queueltem  ‘getnextqueueQI 
return  current->nextitem->element: 

}; 

Queueltem  *  getqueue(int); 

void  loadqueues(fstream&,fstream&,nlist*,int); 

friend  ostream&  operator«(ostream&,Qlist&); 

//  private : 

struct  qitemf 

Queueltem  ‘element; 
qitem  ‘nextitem; 

I: 


qitem  ‘head, 
‘current; 


#endif 
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//  Author  :  Cem  Akin 
//  Advisor  :  Amr  Zaky 
//  Description  :  QLIST  CLASS  source  file. 
//  Date  :  12  November  1992 
//  Last  Revised  :  03  January  1993 

#include  “qlist.h" 

//constructor 
Qiist::Qlist(){ 
head  =  NULL; 
current  =  NULL; 

1 


//Destructor 
Qlist::~Qlist(){ 
delete  head; 
delete  current; 
head  =  NULL; 
current  =  NULL; 


//Loads  graph  queues  from  file  namely  simdata 

void  Qlist:loadqueues(fstream  &grphfile,fstream  &memjilLst*  nolist.int  nummem)( 
int  numqueues  =  0; 
qitem  *  tempq; 
int  gid; 

grphfile  »  numqueues, 
for  (int  loop  =  l;loop  <=  numqueues;  loop++)  ( 
mem»gid; 

gid  =  gid%nummem+l; 
tempq  =  new  qitem; 
tempq->element  =  new  Queueltem; 
if  (head  =  NULL)! 

tempq->element->loadqueue(grphfile,gidjiolist); 
current  =  head  =  tempq; 

I 

else{ 

tempq->element->loadqueue(gTphfile.gid,nolist); 
current->nextitem  =  tempq; 

current  =  current->nextitem; 
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I: 


//Returns  the  graph  queue  whose  id  is  given 
Queueltem*  Qlist::getqueue(int  quid){ 

for  (current=head:current;current=current->nextitem) 
if  (current->element->queueid  ==  quid) 
return  cun-ent->element; 
if  (current  ==  NULL) 

cerr  «  "****ERROR  NO  SUCH  QUEUENn"; 


//Prints  the  graph  queue  list 
ostream&  operator«(ostreain&  os,Qlist&  q)( 
if  (q.head  =  NULL) 
os  «  -QUEUE  IS  EMPTY"  «  end]; 
else) 

for(q.current=q.head;q.current;q.current=q.current->nextitem) 
os  «  *q.cuirent->element «  endl; 

} 

return  os; 

I 
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//  Author  :  Cem  Akin 

//  Advisor  :  Aw  Zaky 

//  Description  '  IEMORY  Class  Header  file. 

//  Date  .  12  November  1992 

//  Last  Revised  :  03  January  1993 

#include  "global. h" 


class  rremorvj 
public: 

memoryO: 

void  setbusytill(double); 
void  setbusy(boolean): 
boolean  isbusyO; 
int  getmemidl): 
int  getbusytimeO; 
void  setobjectid(int); 

pnvate: 

double  busytill; 
int  memid. 
capacity, 
granularity, 
senjptime, 
bandwith; 

statusjype  status; 
boolean  busy; 
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//  Author  :  Cem  Akin 

//  Advisor  :  Amr  Zaky 

//  Description  :  LGDF  machine  simulator  class. 

//  Date  :  12  November  1992 

//  Last  Revised  :  03  January  1993 

#inc  1  ude<  iostream .  h> 

#include"memorv.h" 

//Constructor 
memory::memory()l 
memid  =  0: 
busytill  =  0: 
capacity  =  0; 
granularity  =  0: 
setuptime  =  0; 
bandwith  =  0; 
status  =  active: 
busy  =  false; 

I: 

//Updates  the  busy  time  if  it  is  in  the  future 
void  memory::setbusytill(double  t)| 
busytdl  =  t: 

): 


//Set  the  given  boolean  value 
void  memory: :setbusy( boolean  value)f 
busy  =  value: 


//Returns  true  if  the  memory  is  busy 
boolean  memory::isbusy0( 
return  busy; 

); 


//Returns  memory  id 
int  memory::getmemid()| 
return  rnemict 

1: 
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//Returns  the  time  that  memory  will  stay  busy 
int  memory::getbusytime()( 
return  busytill; 

I: 


//Sets  the  given  number  as  object  id 
void  memory::setobjecdd(int  t)( 
memid  =  t; 

}: 
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//  Author  :  Cem  Akin 

//  Advisor  :  Amr  Zaky 

//  Description  :  MLIST  CLASS(Memory  list). 

//  Date  :  12  November  1992 

//  Last  Revised  :  03  January  1993 

#include"memory.h" 

#include<fstreain  ,h> 

class  mlist  { 
public: 

mlist(); 

-mlistO; 

void  addtolist(memory  *): 
memory*  getmem(int); 
int  loadmemory(fstream&); 

private: 

struct  memnodel 
memory*  m: 
memnode*  next  mem; 

I: 

memnode*  head. 

*  current; 
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//  Author  :  Cem  Akin 

//  Advisor  :  Amr  Zaky 

//  Description  :  MLIST  CLASS  source  code. 

//  Date  :  12  November  1992 

//  Last  Revised  :  03  January  1993 

#include  <iostream.h> 

#include  “mlist.h” 
extern  "C”{ 
exit(int): 

} 


//Constructor 

mlist::mlistOl 
head  =  NULL; 
current  =  NULL; 


//Destructor 
mlist::~mlist(){ 
delete  head; 
delete  current; 
head  =  NULL; 
current  =  NULL; 


//Adds  given  memory  to  the  list 
void  mlist::addtolist(memory  *  mem){ 
if  (head  ==  NULL)( 
head  =  new  memnode; 
head->m  =  mem; 
current  =  head; 

I 

else  ( 

for  (curTentsheadxurrent^nextmem-.currentscurrent^nextmem); 
current->nextmem  =  new  memnode; 
current->nextmem->m  =  mem; 

I: 

I; 
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//Returns  the  memory  whose  id  is  given 
memory  *  m  list:  :getmem(  int  mid)( 

for  (current=head:curTent;current=current->nextmem) 
if  (current->m->getmemid()=mid) 
return  current->m; 

cout  «  mid  «  “  there  is  no  such  memory": 
if  (current  =  NULL){ 
exit(l): 

cerr  «  "***ERROR  undefined  memory  idSn"; 

); 

}: 

//Loads  memories 

int  mlist::loadmemory(fstream&  grphfile)( 
int  nummemory=0: 
memnode*  temp; 
grphfile  »  nummemory; 
for  (int  loop=l;loop<=nummemory:loop++){ 
temp=new  memnode; 
temp->m=new  memory; 
temp->m->setobjectid(loop); 
if  (head  =  NULL) 
current  =  head  =temp; 
else( 

current->nextmem=:temp; 

current=current->nextmem; 


cout  «nummemory; 

return  nummemory; 

L 
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//  Author  :  Cem  Akin 

//  Advisor  :  Amr  Zaky 

//  Description  :  PROCESSOR  CLASS  header  file. 

//  Date  :  12  November  1992 

//  Last  Revised  :  03  January  1993 


#include  "global.h" 

#ifndef  PROCESSORS 
#define  PROCESSOR_H 

class  processor! 
public: 

processor(); 

void  setsetupbusytill(double): 
void  setexecbusytill(double); 
void  setbreakdownbusytill(doubie); 
void  setsetupbusy(boolean); 
void  setexecbusy(boolean); 
void  setbreakdownbusy(boolean); 
void  setfree(boolean): 
void  setproctype(int); 
void  setobjectid(int); 
boolean  isfree(); 
boolean  issetupbusy(); 
boolean  isexecbusyO; 
boolean  isbreakdownbusyO; 
proctype  gettype(); 
double  getsetupbusytimeO; 
double  getbreakdownbusytimeO; 
double  getexecbusytime(): 

int  getprocid(); 


//private: 

proctype  ptype; 
double  setupbusytill. 
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execbusytill, 

breakdownbusytill, 

durationprocbusy, 

onJyexectime. 

startutil: 


int  procid, 
speed: 

status_type  status: 
boolean  readinginprocess, 
writeinprocess, 
waitingonsetup, 
waitingonexec. 
free, 

setupbusy, 

execbusy, 

breakdownbusy, 

pendingsetup, 

pendingbreakdown; 


#endif 
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//  Author  ;  Cem  Akin 

//  Advisor  :  Amr  Zaky 

//  Description  :  PROCESSOR  CLASS  source  file. 

//  Date  :  12  November  1992 

//  Last  Revised  :  03  January  1993 


#include<iostream .  h> 
#include"processor.h” 

//Constructor 
processor: :  processor! )  | 
ptype  =  inst; 
procid  =  0: 
setupbusytill  =  0; 
execbusytill  =  0: 
breakdownbusytill  =0; 
durationprocbusy=0; 
startutil  =  0; 
setupbusy  =  false; 
execbusy  =  false; 
readinginprocess=false; 
writeinprocess=false; 
setupbusy=false; 
execbusy  =  false; 
breakdownbusy  =  false; 
pendingsetup  =  false; 
pendingbreakdown  =  false; 
speed  =  0; 
status  =  active; 
free  =  true; 

waitingonsetup  =  false; 
waitingonexec  =  false; 

1; 


//Updates  the  busy  time  of  the  setup  stage 
void  processor::setsetupbusytill(double  t)( 
if  (setupbusytill  <  t) 
setupbusytill  =  t; 
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//Updates  the  busy  time  of  execution  stage 
void  processor::setexecbusytill(double  t){ 
if  (execbusytill  <  t) 
execbusytili  =  t; 

}: 


//Updates  the  breakdown  stage  busy  time 
void  processor::setbreakdownbusytill(double  t){ 
if  (breakdownbusytill  <  t) 
breakdownbusytill  =  t; 

1: 


//Sets  setup  busy  flag  to  the  given  boolean  value 
void  processor::setsetupbusy(boolean  value) ( 
setupbusy  =  value: 

I: 

//Sets  execution  stage  flag  to  the  given  boolean  value 
void  processor:  :setexecbusy(boolean  value)  { 
execbusy  =  value: 

); 


//Sets  breakdown  busy  flag  to  the  given  boolean  value 
void  processor::setbreakdownbusy(boolean  value)! 

breakdownbusy  =  value: 

I: 


//Sets  the  processor  free  flag  to  the  given  boolean  value 
void  processor::setfree(boolean  value)! 
free  =  value; 

I: 


//Returns  true  if  the  processor  is  free 
boolean  processor::isfree(){ 
return  free; 

}; 
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//Returns  true  if  setup  stage  is  busy 
boolean  processor:  :issetupbusy()( 
return  setupbusy; 

t: 


//Returns  true  if  the  execution  stage  is  busy 
boolean  processor::isexecbusy()( 
return  execbusy; 

}: 


//Returns  true  if  breakdown  stage  is  busy 
boolean  processor::isbreakdownbusy()( 
return  breakdownbusy; 

}; 


//Returns  the  processor  id 
int  processor:  :getprocid(){ 
return  procid; 

I; 


//Returns  time  that  setup  stage  will  stay  busy 
double  processor::getsetupbusytime(){ 
return  setupbusytill; 


//Returns  time  that  execution  stage  will  stay  busy 
double  processor:  -.getexecbusy time()  j 
return  execbusytill; 

I; 


//Returns  time  that  breakdown  stage  will  stay  busy 
double  processor:getbreakdownbusytime()| 
return  breakdownbusytili; 

1; 


//Returns  processor  type 
proctype  processor  :gettype()  f 
return  ptype; 

I; 


//Sets  the  object  id  to  the  given  value 
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void  processorsetobjectid(int  t){ 
procid  =  t; 

l: 


//Sets  the  processor  type  according  to  given  integer  value 
void  processor: :setproctype(int  t)( 
switch  (t){ 

case  0:  { 

ptype  =  inst; 
break: 

I; 

case  1:{ 

ptype  =  io; 
break; 

}; 

case  2:  { 

ptype  =  sec; 
break; 

}; 

case  3:{ 

ptype  =  three; 
break; 

I: 

case  4:  ( 

ptype  =  four; 
break; 


case  5:{ 

ptype  =  five; 
break; 

}; 

case  6;  | 

ptype  =  six; 
break; 

}; 

case  7:{ 

ptype  =  seven; 
break; 


case  8;  ( 

ptype  =  eight; 
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//  Author  :  Cem  Akin 
I I  Advisor  :  Amr  Zaky 

//  Description  :  PLIST  CLASS  header  file(Processor  list). 
//  Date  :  12  November  1992 
//  Last  Revised  :  03  January  1993 

#incl  ude"processor.  h” 


#include<fstream.h> 

#ifndef  PLIST_H 
#defme  PLIST_H 
class  plist  { 
public: 
plist(); 

-plist(); 

void  addtolist(processor*); 
processor*  getproc(int); 
int  loadprocessors(fstream&,int): 
//private: 

struct  procnodel 
processor*  p; 
procnode*  nextproc; 

): 

procnode*  head, 

*  current; 

}: 


#endif 
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//  Author  :  Cem  Akin 
//  Advisor  :  Ami  Zaky 
//  Description  :  PLIST  CLASS  source  tile. 
//  Date  :  12  November  1992 
//  Last  Revised  :  03  January  1993 

#include  <iostream.h> 

#include"plist.h" 

#include  <fstream.h> 

//Constructor 

plist::plist(){ 

head  =  NULL: 
current  =  NULL; 

); 


//Destructor 
plist::~plist()( 
delete  head: 
delete  current; 
head  =  NULL; 
current  =  NULL; 


//Adds  given  processor  to  processor  list 
void  plist::addtolist(processor  *  proc)( 
if  (head  ==  NULL)! 
head  =  new  procnode; 
head->p  =  proc; 
current  *  head: 

1 

else  1 

for(current=head:current->nextproc;curreM=current->nextproc); 
current->nextproc  =  new  procnode; 
current->nextproc->p  =  proc: 

): 


//Returns  the  processor  whose  id  is  given 
processor*  plist;:getproc(int  pid) { 
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for  (cuiTent=head;curTent;carrent=canent->nextproc) 
if  (current->p->getprocid()=pid) 
return  current->p; 
if  (current  =  NULL)  | 

cout  «”***ERROR  undefined  processor  id\n"; 

I 

1; 

//Loads  processor  from  given  stream 
int  pIist::ioadprocessors(fstream&  grphfile.int  numprocs)( 
int  prtype; 
int  proccnt  =  0: 

procnode*  temp; 

for  (int  loop=l;Ioop  <=  numprocs;Ioop++)  ( 
temp  =  new  procnode; 
temp->p=  new  processor, 
temp->p->setobjectid(loop); 
grphfde  »  prtype; 
if  (prtype  !=  1) 
proccnt++; 

ternp->p->setproctype(prtype); 
if  (head  ==NULL) 
current=head=temp: 

else( 

current->nextproc=temp; 

current=current->nextproc; 


return  proccnt; 

I; 
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//  Author  :  Cem  Akin 
//  Advisor  :  Amr  Zaky 
//  Description  :  SCHEDULER  CLASS  header  file. 

//  Date  :  12  November  1992 
//  Last  Revised  :  03  January  1993 

#include  "pnqueue.h” 

#include  “gnode.h" 

#include  ‘'nlist.h" 

#include  "plist.h" 

#include  “pqueue.h” 
class  scheduler  ( 
public : 

scheduled); 

-scheduled); 

void  putnodeinrlfgnode*  ); 
void  putprocinfi(int); 
void  printreadylistO; 
void  printproclist(); 
boolean  isbusyO; 
void  setbusy(boolean); 
int  getbusytillO; 
int  getnodefromrK); 
void  setbusytill(int); 

void  schedule_node(nlist*,plist*.pqueue,,.double); 
boolean  emptyrK); 
boolean  member(int); 

//private; 

policy_type  policy; 

double  processorid, 
busytill. 
schedulingtime; 

boolean  busy; 

pnqueue*  readynodelist, 

*  freeproclist; 
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//  Author  :  Cem  Akin 
//  Advisor  :  Amr  Zaky 

//  Description  :  SCHEDULER  CLASS  source  code. 
//  Date  :  12  November  1992 
//  Last  Revised  :  03  January  1993 

#include  <iostream.h> 

#include  “scheduler.h" 

#include  “global. h’’ 

#include  “event.h” 


//Constructor 
scheduler: :  scheduler()  { 

policy  =  synhronized; 
processorid  =  0; 
schedulingtime  =  0; 
readynodelist  =  new  pnqueue; 
freeproclist  =  new  pnqueue: 

}; 


//Destructor 

scheduler::~scheduler(){ 
delete  readynodelist: 
readynodelist=NULL; 


//Puts  the  given  graph  node  to  the  ready  node  list 
void  scheduler::putnodeinrl(gnode*  n){ 
intnode*  tempnode=new  intnode: 

//  n->priority  =  n->nodeinstance*(-l); 

tempnode->setnode(n->getnodeid()ji->getnodepriorityO); 

readynodelist->enqueue(tempnode); 

1: 


//Puts  the  given  processor  to  the  free  processor  list 
void  scheduler::putprocinfl(int  pid)| 
intnode*  tempnode=new  intnode; 
tempnode->semode(pid.O): 
freeproclist->enqueue(tempnode); 
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//Sets  scheduler  busy  flay  to  the  given  boolean  value 
void  scheduler::setbusy(boolean  value)! 
busy  =  value; 

): 


//Returns  true  if  the  scheduler  is  busy 
boolean  scheduler::isbusy()( 
return  busy; 

I; 


//Returns  the  time  that  scheduler  will  be  busy 
int  scheduler::getbusytiU(){ 
return  busytill; 

}; 


//Returns  a  node  from  ready  node  list  if  queue  is  empty  return  - 1 
int  scheduler;;getnodeffomrl()( 
intnode*  temp; 

temp=readynodelist->dequeue(); 
if  (temp  !=NULL) 
return  temp->geteid(); 
else 

return  -1; 

I: 


//Updates  the  scheduler  busy  time  to  the  given  value 
void  scheduler::setbusytill(int  t)( 
busytill=t: 

I: 


//Returns  true  if  the  ready  node  list  is  empty 
boolean  scheduler::emptyrl(){ 

tf  (readynodelist->head  ==  NULL) 
return  true; 
else 

return  false; 

I; 


//Schedules  a  node  from  ready  list  by  matching  it  to  a  free  processor  and 
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//  produces  start  setup  event.First  it  tries  to  Find  a  node  which  is  not 
//currently  executing 

void  scheduler::schedule_node(nlist*  nl.plist  *pl,pqueue*  eventqueue.double  clck)  { 

event*  tempevent=  new  event; 
gnode*  n: 
processor  *p; 
pnqueue::item*  previous, 

*  previous2; 


if  (readynodelist->head  !=  NULL){ 
previous  =  NULL; 

for  (readynodelist->current=readynodelist->head; 
readynodeiist->current; 

readynodelist->current=  readynodelist->current->nextitem)  { 
n  =  nl->getnode(readynodelist->current->nodeitem->geteid()); 
if  (!(n->processing)){ 
previous2  =  NULL; 

for  (freeproclist->current=freeproctist->head; 
£reeproclist->current; 

freeproclist->current=freeproclist->current->nextitem)| 

p  =  pl->getproc(freeproclist->current->nodeitem->geteidO): 
if((n->getproctype()=p->gettype())  II 
(n->getaltproctype()  =  p->gettype()  ))| 
busy  =  true; 

busytiil  =  clck+schedulingtime; 
tempevent->eventname=free_scheduler; 
tempevent->starttjme=busytill; 
tempevent->priority  =busytiU; 
eventqueue->enqueue(tempevent); 

if  (previous2  ==  NULL) 

freeproclist->head=freeproclist->current->nextitem; 

else 

previous2->nextitem=freeproclist->current->nextitem; 
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if  (previous  ==  NULL) 

readynodelist->head=readynodelist->current->nextitem; 

else 

previous->nextitem=readynodelist->current->nexutem; 

p->setfree(false); 

n->processing=  true; 

n->exectimes++; 

n  ->nodeinstance++; 

tempevent=  new  event; 

tempevent->eventname=start_setup; 

tempevent->starttime=clck; 

tempevent->priority  =clck; 

tempevent->nodenum  =n->getnodeid(); 

tempevent->assocproc=p->getprocid(); 

eventqueue->enqueue(tempevent); 

/*  if  (p->gettype()!=io){ 

cout «  "PROCESSOR  ID  :"«tempevent->assocproc; 
cout «  “  NODE  ID  :”«tempevent->nodenum«endl; 
};*/ 

return ; 


previous2  =  freeproclist->current; 
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for  (pl->current=pl->head;pl->current;pl->current=pl->current->nextproc)( 

if((n->getproctype()=pl->current->p->gettype())  II 
(n->getaltproctypeO  =  pl->current->p->gettype() )){ 
if  (pl->current->p->isfTee()){ 
busy  =  true; 

busytill  =  clck+schedulingtime; 
tempevent->eventname=free_scheduler; 
tempevent->starttime=busyti]l; 
tempevent->priority  =busytiU; 
eventqueue->enqueue(tempevent); 

if  (previous  ==  NULL) 

readynodelist->head=readynodelist->current->nextitem; 
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else 

previous->nextitem=readynodelist->curTent->nextitem; 


pl->current->p->setfinee(  false); 

n->processing=  true; 

n->exectimes-M-; 

n->nodeinstance++; 

tempevent=  new  event; 

tempevent->eventname=start_setup; 

tempe  vent->starttime=c  lck; 

tempevent->priority  =clck; 

tempevent->nodenum  =n->getnodeid(); 

tempevent->assocproc=pl->current->p->getprocid(); 

eventqueue->enqueue(tempevent); 

return ; 


*/ 


): 

previous  =  readynodelist->current; 

1; 

t; 


//Returns  true  if  the  given  node  is  already  in  the  ready  node  list 
boolean  scheduler::member(int  num)( 
if  (readynodelist->head  !=  NULL) 
for  (readynodelist->cuiTent=ready  node  list->  head: 
readynodelist->cunent; 

readynodelist->current=  readynodelist->current->nextitem) 
if  (readynodelist->currem->nodeitem->geteid()==num) 
return  true; 

return  false; 

1: 
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//Prints  the  ready  node  list 
void  scheduler::printreadylist()| 
cout « •'««« 

for  (readynodelist->current=readynodelist->head: 
readynodelist->current; 

readynodelist->current=  readynodelist->current->nextitem) 
cout  « readynodelist->current->nodeitem->nid  «  " 

cout «  "»»»\n": 

1: 

//Prints  the  free  processor  list 
void  scheduler::printproclist()i 
cout « '‘««« 

for  (freeproclist->current=freeproclist->head; 
freeproclist->current; 

freeproclist->current=  freeproclist->current->nextitem) 
cout «  freeproclist->current->nodeitem->nid  «  “ 

cout «  “»»»\n”; 

>; 
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//  Author  :  Cem  Akin 
//  Advisor  :  Amr  Zaky 

//  Description  :  PNQUElfE  CLASS  header  tile  (Priority  Queue  For  integer  node. 
//  Date  :  12  November  1992 
//  Last  Revised  :  03  January  1993 

#include  "node.h" 

#ifndef  PNQUEUE_H 
#define  PNQUEUE.H 

class  pnqueuef 

public: 

pnqueueO: 

~pnqueue(); 

void  enqueue(intnode*); 
intnode*  dequeue(); 

//private: 

struct  itemf 
intnode  *nodeitem; 
item  *nextitem; 

); 


item  *head, 
*current; 


#endif 
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//  Author  :  Cem  Akin 

//  Advisor  :  Amr  Zaky 

//  Description  :  PNQUEUE  CLASS  source  file. 

//  Date  :  12  November  1992 

//  Last  Revised  :  03  January  1993 

#include  "pnqueue. h” 

#include  <iostream.h> 

//Constructor 
pnqueue::pnqueue()< 
head  =  NULL; 
current  =  NULL; 

I: 


//Destructor 
pnqueue :;  -pnqueueOI 
delete  head; 
delete  current; 
head  =  NULL; 
current  =  NULL; 


//Enqueues  the  given  integer  node  to  queue 
void  pnqueue  ::  enqueue/ intnode  *e)  j 
item  *tempnode=new  item; 
item  *previous=head; 
tempnode->nodeitem=e; 


if  (head  =  NULL) 

current=head=tempnode; 

else{ 

current=  head; 

while  ((current->nextitem!=NULL)  && 
(current->nodeitem->getpriority()<=e->getpriority())){ 
previous  =  current; 
current  =  current->nextitem; 


if  ((current~>nodeitem->getpriority()  >  e->getpriorityO)&& 
(current  !=  NULL)&&(current!=head)M 
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previous->nextitem=tempnode; 

tempnode->nextitem=current; 

} 

else) 

if((current->nextitem=NULL)  && 

(current->nodeitem->getpriorityO  <=  e->getpriority())){ 
current->nexdtem  =  tempnode; 

I 

elsej 

current  =  head; 
head  =  tempnode; 
head->nextitem  =  current: 

}; 

}; 

}; 

I; 

//Returns  integer  node  from  queue 
intnode  *  pnqueue  ::  dequeue(){ 
item*  temp; 
temp=  head; 
if  (head  !=  NULL){ 
head=head->nextitem; 
current  =  head; 
return  temp->nodeitem; 

I; 

I: 
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//  Author  :  Cem  Akin 
//  Advisor  :  Amr  Zaky 

//  Description  :  PQUEUE  CLASS  header  file(Priority  queue  for  events). 
//  Date  :  12  November  1992 
// Last  Revised  :  03  January  1993 


#include  “event.h” 

#ifndef  PQUEUE_H 
#define  PQUEUE_H 
class  pqueuei 

public: 

pqueue(); 

~pqueue(); 

void  enqueue(event‘); 
void  printeqO: 
event*  dequeue)); 
boolean  emptyO: 

private: 

struct  it{ 

event  *eventitem; 
it  *nextitem; 

>; 


it  ‘head, 
‘current; 


I: 

#endif 
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//  Author  :  Cem  Akin 

//  Advisor  :  Amr  Zaky 

//  Description  :  PQUEUE  CLASS  source  file. 

//  Date  :  12  November  1992 

//  Last  Revised  :  03  January  1993 


#include  ''pqueue. h” 
^include  <iostream.h> 

//constructor 
pqueue::pqueueO! 
head  =  NULL; 
current  =  NULL; 
l; 


//Destructor 
pqueue ::  ~pqueue()( 
delete  head; 
delete  current; 
head  =  NULL; 
current  =  NULL; 


//Enqueues  the  given  event  according  its  priority 
void  pqueue  ::  enqueue(event  *e)  { 
it  *tempnode=new  it; 
it  *previous=head; 
tempnode->eventitetn=e; 


if  (head  =  NULL) 

current=head=tempnode; 

else{ 

currents  head; 

while  ((current->nextitem!=NULL)  && 
(current->eventitem->getpriorityO<=e->getpriority()))( 
previous  =  current; 
current  =  current->nextitem; 


if  ((current->eventitem->getpriority()  >  e->getpnority())&& 
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(current  !=  NULL)&&(current!=head)){ 
previous->nextitem=tempnode; 
tempnode->nexdtem=current: 

I 

else{ 

if((cun,ent->ne;uitem=NULL)  && 

(current->eventitem->getpriority()  <=  e->getpriority()))( 
current->nextitem  =  tempnode: 

I 

else{ 

current  =  head: 
head  =  tempnode: 
head->nextitem  =  current: 


//Dequeues  the  event  from  the  top  of  the  queue 
event  *  pqueue  ::  dequeue(){ 
it*  temp; 
temp=  head; 
head=head->nextitem; 
current  =  head: 
return  temp->eventitem: 


//Returns  true  if  the  Queue  is  empty 
boolean  pqueue::  empty(){ 
if  (head==NULL) 
return  true; 
else 

return  false: 

I: 


//Prints  the  queue 
void  pqueue:  :printeq()( 
cout «  “< 

for  (current  =  head:curTent:current  =  current->nextitem) 
cout «  current->eventitem->geteventname() «  ‘  *: 
cout «  “>  Vi"; 
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//  Author  :  Cem  Akin 
//  Advisor  :  Amr  Zaky 
//  Description  :  NODE  CLASS  header  file. 

//  Date  :  12  November  1992 
//  Last  Revised  :  03  January  1993 

#include  <iostream.h> 

#ifndef  NODE_H 
#define  NODE_H 
class  intnodel 
public: 

intnode(); 
int  getpriorityO; 
int  geteidO; 
void  setnode(int  .int); 

friend  ostream&  operator«(ostream&.intnode&); 
//  private: 
int  nid. 
priority; 

1: 

#endif; 
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//  Author  :  Cem  Akin 
//  Advisor  :  Amr  Zaky 
//  Description  :  NODE  CLASS  source  file. 
//  Date  :  12  November  1992 
// Last  Revised  :  03  January  1993 

#include  "node.h" 

//Constructor 
intnode::intnode()( 
nid  =0: 
priority  =  0: 

I: 


//Returns  node  priority 
int  intnode::getpriority(){ 
return  priority; 


//Returns  node  id 
int  intnode::geteid()| 
return  nid; 

I: 


//Set  node  with  id  and  prty 
void  intnode::setnode(int  id,int  prty)  j 
nid  =  id; 
priority  =  prty; 


//Prints  a  node 

ostream&  operator«(ostream&  os,  intnode&  e)| 
os  «  "Event  ID;  “«e.nid; 
os  «  endi; 

os  «  "priority  «  e.priority; 
os  «  endl; 
return  os; 

I: 
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//  Author  :  Cem  Akin 
//  Advisor  :  Amr  Zaky 
//  Description  :  Integer  List  Class  header  file. 

//  Date  :  12  November  1992 
//  Last  Revised  :  03  January  1993 

#include  “global.h” 

#ifndef  IL1ST_H 
#define  ILIST_H 

class  intlist{ 
public: 

intlist(); 

-intlistO; 

void  addtolist(int); 
boolean  finditem(int ); 

friend  ostream&  operator«(ostream&  jntlist&); 

//private: 

struct  intnodei 
int  number; 
intnode  *nextnode; 

); 

intnode  *head. 

‘current; 


#endif 
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//  Author  :  Cem  Akin 

//  Advisor  :  Amr  Zaky 

//  Description  :  Integer  List  Class  source  file. 

//  Date  :  12  November  1992 

//  Last  Revised  :  03  January  1993 

#include<iostream.h> 

#include  "ilist.h” 

//Contructor 
intlist::intlist(){ 
head  =  NULL: 
current  =  NULL; 

»: 


//Destructor 
intlist::~intlist(){ 
delete  head; 
delete  current; 
head  -  NULL; 
current  =  NULL; 


//If  given  integer  is  in  list  returns  true 
boolean  intlist;;finditem(int  num){ 
boolean  result=false; 
if  (head  =NUI  L) 
return  result; 
else 

for(current=head;current->nextnode;current  =  current->nextnode) 
if(current->number  ==  num) 
result=true; 
return  result; 

»: 


//Adds  the  given  integer  to  the  list 
void  intlist:  :addtolist( int  num){ 
if  (head==NULL){ 
head  =  new  intnode: 
head->number  =  num; 
current=head; 
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I 

else  j 

for(current=head;cuirent->nextnode;curTent=current->nextnode); 
current->nextnode=new  intnode: 
current->nextnode->number=num; 
current  =  current->nextnode; 


//Prints  the  integer  list 

ostream&  operator«(ostream&  os.intlist&  ilist)  ( 
os  «  '<■  «  “ 

for  (ilist.current  =  ilist.head:ilist.current; 

ilist.current  =  ilist.current->nextnode) 
os  « ilist.current->number  «  “ 
os  «  '>’  «  endl: 
return  os; 

I: 
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APPENDIX  C:  Graph  Restructure  Source  Code 


//  Author  :  Cem  Akin 

//  Advisor  :  Amr  Zaky 

//  Description  :  CNODE  CLASS  header  file. 

//  Date  :  10  March  1993 

//  Last  Revised  :  13  March  1993 


class  cnode{//This  class  is  used  to  represent  a  space  that  is 
//assigned  to  a  graph  node. 

public : 

cnode(){ 
id  =0: 
start  =  0; 
finish  =  0; 

I; 

cnode(int  a){ 
id  =0: 
start  =  0; 
finish  =  a: 

}: 


int  id: 
int  start, 
finish; 

I; 
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//  Author  :  Cem  Akin 

//  Advisor  :  Amr  Zaky 

//  Description  :  CYLINDER  CLASS  header  file. 

//  Date  :  10  March  1993 

// Last  Revised  :  13  March  1993 

#include  "nodelist.h” 

#include  "nlist.h" 

#include  “qlist.h" 


class  cylinder  (//This  class  is  used  to  represent  a  cylinder 
public: 

cylinderO: 

~cylinder(); 

void  initialize_cylinder(); 
void  cylinder_assignment(); 
void  cylinder::initialize_times(): 
int  find_lalest_end_parent(int); 
boolean  map_cylinder(); 


struct  cyl_slice( 
nodelist  *slice; 
nodelist  *el; 

1: 

cyl_slice  cylin(20J: 
int  circum; 
int  ps; 

nlist  *gnodelist, 
*sortedlist; 
Qlist  *gqueuelist; 


//  Author  :  Cem  Akin 

//  Advisor  :  Amr  Zaky 

//  Description  :  Cylinder  Class  Source  tile. 

//Date  :  10  March  1993 

// Last  Revised  :  13  March  1993 


#include  "cylinder. h" 

#include  <iostream.h> 

//Cylinder  class  constructor 
cylinder::cylinder(){ 
int  c: 

int  ps  =  0; 
int  circum  =  0; 
gnodelist  =  new  nlist; 
sortedlist=new  nlist; 
gqueuelist  =  new  Qlist; 
for  (c  =0:c<20;c++){ 
cylin[c].slice  =  NULL; 
cylin[c].el  =  NULL; 


}; 

//Cylinder  class  destructor 
cylinder;;~cylinder()  { 
intc; 

delete  gnodelist; 
delete  gqueuelist; 
for  (c=0;c<20;c++){ 
delete  cylin[c]  .slice; 
delete  cylin[c].el; 
cylin[c].slice  =  NULL; 
cylin[c].el  =  NULL; 

I; 

gnodelist  =  NULL; 
gqueuelist  =  NULL; 

1; 

//Initialize  the  cylinder  to  empty  cylinder 
void  cylinder.:initialize_cylinderO( 
int  c; 

for  (c=0;c<20;c++)( 
delete  cylin[c).  slice; 
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delete  cylin[c].el; 
cylinfcj.slice  =  new  nodelist; 
cylin(c].el  =  new  nodelist(circum); 


//Clears  the  assigned  execution  times  when  mapping  attemp  is  not 
//succcsfuli 

void  cylinder::initialize_times(){ 

for  (gnodelist->current=gnodelist->head; 
gnodelist->current; 

gnodelist->current=gnodelist->currem->nextitem){ 

gnodelist->current->element->start=0; 

gnodelist->cuiTent->element->finish=0; 


//Finds  the  latest  end  parent  for  a  given  node  and  returns  it 
int  cylinder::fmdjatest_end_parent(int  num){ 
int  max=0; 
gnode*  tempnode, 

*  tempnode2; 

tempnode=gnodelist->getnode(num); 

for(tempnode->parendist->current=tempnode->parentlist->head; 

tempnode->parentlist->current; 

tempnode->parentlist->current=tempnode->parentlist->current->nextnode)( 
tempnode2=gnodelist->getnode(tempnode->parentlist->current->number); 
if  (max<tempnode2->finish) 
max  =  int(tempnode2->finish); 

I: 

return  max; 

}; 


//Maps  the  cylinder  according  to  its  topology 
void  cylinder  :cylinder_assignment  01 
boolean  flag; 
fstream  cyl; 
double  percent; 
int  cnumnodes; 

c  =  gnodelist->sort_topo!ogically(gqueuelist,sortedlist); 
gnodelist  =  sortedlist; 
flag  =  map_cylinder(); 
while  (!flag){ 
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cout «  “I  COULD  NOT  FIND  ANY  SOLUTION  WITH  “; 
cout  «  circum  «  \n"; 

cout «  “I  NEED  LARGER  CIRCUMFERENCE.  WILL  YOU  GIVE  ME  A\n”; 
cout «  “PERCENT  THAT  I  CAN  INCREASE  THE  SIZE  OF  CYLINDER'*”; 
cout «  "PERCENT  : 
cin  »  percent; 

cout «  “VrHANK  YOU  NOW  I  AM  TRYING  TO  FIND  A  SOLUTIONS'; 

circum  =  int(circum  +  circum  *  percent); 

initialize_cylinder(); 

initiaiize_times(): 

flag=map_cylinder(); 

}; 

cyl.open(“cyl.dat”,ios::out); 
for  (c=0;c<ps;c++){ 
numnodes  =  0; 

for  (cylin[c].slice->current=cylin[c].slice->head: 
cylin[c].slice->current; 

cylin[c].slice->current=cylin[c].slice->current->nextitem){ 

numnodes++; 

cout «  cylin[c].slice->current->e!ement->id«“  "; 
cout «  cylin[c].slice->current->element->start«“  “; 
cout «  cylin[c].slice->current->element->finish<<endl; 

I; 

cout «  endl; 

cout  «"***********************************************\fi”- 
cyl «  numnodes«endl; 
for(cylin[c].slice->cuxTent=cylin[c].slice->head; 
cylin[c].slice->current; 

cylin[c].slice->curTent=cylin[c].slice->current->nextitem)( 
cyl «  cylin[c].slice->current'>element->id«“ 

I: 

cyl  «endl; 

>: 

cout «  circum«endl; 
cyl  «  circum«endl; 


//Tries  to  find  a  solution  for  mapping  and  called  by  cylider  assignment 
//function 

boolean  cylinder:  :map_cylinderO( 
nitem  *tptr. 
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cnode  *tempspace, 

*tempcnode: 
boolean  found  1  =  false; 
boolean  found2  =  false; 
boolean  found3  =  false; 

int  m  instart  1  =  circum; 
int  minstart2  =  circum; 
int  minscart3  =  circum; 

int  t, 
tl, 
t2, 
t3, 
t4, 
t5, 

t6; 

intcnt,sl,s2,s3,sid,c; 
int  count=0; 

for  tgnodelist->current  =  gnodelist'>head: 
gnodelist->current; 

gnodelist->current  =  gnodelist->current->nextitem)( 
if  (gnodelist->current->element->ntype=instruction)  ( 
tptr  =  gnodelist->current: 

t  =  findJatest_end_parent(gnodelist->current->element->nodeid); 
gnodelist->current=tptr, 
for  (cnt=0;cnt<ps;cnt++) 
for  (cylin(cnt].el->current=cylin[cntj.el->head; 
cylin[cntl.el->cuiTent; 

cylin[cnt].el->current=cylin[cnt].el->current->nextitem)i 
if  ((cylin[cnt].el->current->element->start>-'.)&& 
(cylin[cnt].el->current->element->finish  >= 
cylin[cnt].el->current->element->start  + 
gnodelist->current->element->exectime)) 
if  (cylin[cnt].el->current->element->start<minstartl)  i 
minstartl  =  cylin[cnt].el->current->element->start; 
found  1  =  true; 

sid  =  cylin[cnt].el->cuiTent->element->id; 
tl  =  cylin[cnt].el->current->element->start; 
t2  =  cylin[cnt].el->current->element->finish; 
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si  =  cnt; 

I; 

if  ((cylin[cnt].el->current->element->start<t)&& 
(cyUn[cnt].el->cunent->element->finish  >=t  + 
gnodelist->current->element->exectime)) 
if  (cylin[cnt].el->current->element->start<minstart3)( 
minstart?  =  cylin[cnt].el->cuirent->element->start; 
found3  =  true; 

sid  =  cylin[cnt].el->current->element->id: 
t5  =  cylin[cnt].el->current->element->start: 
t6  =  cylin[cnt].el->current->element->finish; 
s3  =  cnt: 

): 

if  ((cylin[cnt].ei->cument->element->stan+ 
gnodelist->current->element->exectinie)  <= 
cylin[cnt].el->current->element->finish) 
if  (cylinfcnt]  .el->current->element->start<minstart2)  { 
minstart2=cylin[cnt].el->current->element->start; 
sid  =  cylin[cnt].el->current->element->id; 
s2  =  cnt; 
found2  =  true; 

t3  =  cylin[cnt].el->current->element->start; 
t4  =  cylin[cnt].el->current->element->Fuiish; 

U 


if  (foundl){ 

tempcnode  =  new  cnode; 

tempcnode->id  =  gnodeIist->current->element->nodeid; 
tempcnode->start  =  tl; 

tetnpcnode->finish  =  tl  +  gnodelist->current->element->exectime; 
gnodelist->current->element->start=t  1 ; 
gnodelist->current->element->finish=tempcnode->finish: 
cylin[sl].slice->insert(  tempcnode); 
cy  lin[s  1  ]  .el->remove(t  l ): 
if  (tl<t){ 

tempspace  =  new  cnode; 
tempspace->id  =  ++count; 
tempspace->start  =  tl; 
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tempspace->finish=  t: 
cylin[sl].el->insert(tempspace); 

}: 

if  (tempcnode->fmish  <  t2){ 
tempspace  =  new  cnode: 
tempspace->id  =  ++count: 
tempspace->start  =  tempcnode->finish; 
tempspace->finish=t2; 
cylinfs  1  ].el->insert(  tempspace); 


f*  for  (cylin[sl].el->cuiTent=cylm[sl].el->head: 

cylin[sl].el->current: 

cylin[sl].el->current=cylin[sl].el->current->nextitem)| 
cout «  cylinfs  l].el->current->elennent->start «  “ 
cout «  cylin[sl].el->current->element->finish«endl; 

): 

*/ 

foundl  =  false; 
found2  =  false; 
found3  =  false; 
m  instart  1  =circum; 
minstart2  =  circum; 
minstart3  =  circum; 

1 

else 

if  (found3)i 

tempcnode  =  new  cnode; 

tempcnode->id  =  gnodelist->current->element->nodeid; 
tempcnode->start  =  t; 

tempcnode->fmish  =  t  +  gnodelist->current->element->exectime; 
gnodelist->current->element->start=t; 
gnodelist->cunrent->element->rmish=tempcnode->finish; 
cylin[s3]  .slice->insert(tempcnode); 
if  (tempcnode->id=26) 
cout « t «“  “«t5«‘‘  “«t6«endl; 
cylin[s3].el->remove(t5); 
if  (t5<t){ 

tempspace  =  new  cnode; 
temp$pace->id  =  ++count; 
tempspace->start  =  t5; 
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tempspace->finish=  t; 
cylin[s3].el->insert(tempspace): 

I: 

if  (tempcnode->finish  <  t6)j 
tempspace  =  new  cnode; 
tempspace->id  =  -H-count; 
tempspace->start  =  terrncnode->finish; 
tempspace->finish=t6; 
cylin[s3].el->insert(tempspace); 

I: 

found  1  =  false; 
found2  =  false; 
found3  =  false; 
minstartl  =  circum; 
minstart2  =  circum; 
minstart3  =  circum; 

I 

else 

if(found2){ 

tempcnode  =  new  cnode; 

tempcnode->id  =  gnodelist->current->element->nodeid; 
iempcnode->sian  =  t3; 

tempcnode->finish  =  t3  +  gnodelist->current->element->exectime: 
gnodeIist->current->element->start=tl ; 
gnodelist->current->element->finish=tempcnode->finish; 
if  (tempcnode->id=26) 
cout «  t «“  “«t3«“  “«t4«endl; 
cylin[s2].slice->insert(  tempcnode); 
cylin[s2].el->remove(t3); 
if  (tempcnode->finish  <  t4)( 
tempspace  =  new  cnode; 
tempspace->id  =  -H-count; 
tempspace->start  =  tern  pcnode->  finish: 
tempspace->finish=t4; 
cylin[s2].el->insert(  temps  pace); 

}: 

found  1  =  false; 
found2  =  false; 
found3  =  false; 
minstartl  =  circum; 
minstart2  =  circum; 
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minstart3  =  circum; 


1 

else 

return  false: 


>: 

1; 

cout «  "CYLINDER  FRAGMANTATION  \n”; 
for  (c=0:c<ps;c++){ 

for  (cylin[c].el->current=cylin[c].el->head: 
cyUn[c].el->current: 

cylin[c].el->current=cylin[c3.el->current->nextitem)i 
cout «  cylin[c].el->current->element->id«" 
cout «  "start....:  “«cyIin(c].eI->cuiTent->element->start: 
cout «  “finish....:  "<<cylin[c].el->curTent->element->finish; 
cout «  endl; 


cout«“* 


return  true; 


main(){ 

int  tottime: 
fstream  myftle; 

myfile.open(“simdata",ios::in); 

cylinder  *c  =  new  cylinder: 

cout «  "Enter  Cylinder  Circumference 

cin  »  c->circum; 

cout «  c->circum; 

cout  «  VEnter  Processor  Number 

cin  »  c->ps; 

c->initialize_cylinderO; 

tottime  =  c->gnodelist->loadnodes(myfile); 

c->gqueuelist->loadqueues(myfile.c->f’nodebst); 

c->cylinder_assignment(): 
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//  Author  :  Cem  Akin 

//  Advisor  :  Amr  Zaky 

!l  Description  :  ARCNODH  CLASS  header  file. 

//  Date  :  12  November  1992 

// Last  Revised  :  03  January  1993 


class  arcnode ! 
public: 

int  sourcenodeid. 
sinknodeid. 
initialjength, 
threshold, 
production, 
consumption, 
capacity; 
arcnode(){ 
sourcenodeid  =  0; 
sinknodeid  =  0; 
initial_length=0; 
threshold  =0; 
production  =0; 
consumption  =0: 
capacity  =0: 

I: 

I; 
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//  Author  :  Cem  Akin 

//  Advisor  :  Amr  Zaky 

//  Description  :  ARCNODE  CLASS  header  fde. 

//  Date  :  12  November  1992 

// Last  Revised  :  03  January  1993 


#include  "arcnode.h" 
#include  "global.h” 
#ifndef  ARCLIST_H 
#define  ARCL1ST_H 


class  arclist{ 
public: 

arclistO;  //constructor 

-arclist():  //destructor 

boolean  is_already_exist(int,  int); 
void  addtolist(arcnode‘); 

//  private : 

struct  sitem{ 

arc  node  ‘element; 
sitem  ‘nextitem; 


sitem  ‘head, 
‘current; 


#endif 
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//  Author  :  Cem  Akin 

//  Advisor  :  Amr  Zaky 

//  Description  :  ARCLIST  CLASS  source  file. 

//  Date  :  12  November  1992 

//  Last  Revised  :  03  January  1993 


#include  <iostream.h> 
#include  “arclist.h" 

//Constructor 
arclist::arclist(){ 
head  =  NULL; 
current  =  NULL; 


//Destructor 
arc  li  st: ;  — arclist( )  { 
delete  head; 
delete  current; 
head  =  NULL; 
current  =  NULL; 


boolean  arclist::is_already_exist(int  nr.int  ns){ 

for  (current=head;cunrent;current=cuiTent->nextitem) 
if  ((current->element->sourcenodeid=ns)&& 
(current->element->sinknodeid==nr)) 
return  true; 

return  false; 

I; 


void  arclist:;addtolist(arcnode*  an){ 
sitem*  tempn; 
tempn  =  new  sitem; 
tempn->element=an; 


if  (head==NULL) 
head  »  current  =  tempn; 
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else{ 

for  (cun'ent=head;cairent->nextitem;currem=current->nexDiem); 

c  arre  nt->nextitem=tempn ; 

current=tempn; 

): 
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//  Author  :  Cem  Akin 
//  Advisor  :  Amr  Zaky 

//  Description  :  CYLINDER  CLASS  header  fde.This  used  for  index  assignment  and  dependency 
//  arc  creation 

//  Date  :  12  November  1992 
//  Last  Revised  :  03  January  1993 

#include  “clist.h” 

#include  "arc  list,  h” 
class  cylinder  { 
public: 
struct  slice{ 
int  slicenum; 
clist*  nodelist; 
slice  ‘nextslice; 

): 


cylinder!)  { 
head  =  NULL; 
current  =  NULL; 


-cylinder!)  { 
delete  head; 
delete  current; 
head  =  NULL; 
current  =  NULL; 

cnode*  getnode(int); 
cnode*  find_latest(double); 
cnode*  f nd_latest_start( double) ; 
void  flnd_RC_arcsfint); 
void  start_after_start(int); 
void  start_after_finish(int); 
void  assign_indices(intant); 
slice  *head, 

‘current; 
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//  Author  :  Cem  Akin 
//  Advisor  :  Amr  Zaky 

//  Description  :  CYLINDER  CLASS  source  file.This  used  for  index  assignment  and  dependency 
//  arc  creation. 

//  Date  :  12  November  1992 
//  Last  Revised  :  03  January  1993 
#include  “cylinder.h” 

#include  '‘nlist.h" 

#include  “qlist.h” 

#include  “mlist.h” 

#include  “plist.h” 

nlist  *gnodelist; 

Qlist  *gqueuelist: 

int  numprocs, 
numqueues, 
numnodes, 
nummemory, 
cyl_circum; 

cnode*  cylinder::getnode(int  nid){ 
for  (current=head;current;current=current->nextslice) 
for(current->nodelist->curTent=current->nodelist->head; 
current->nodelist~>current; 

current->nodelist->current=current->nodelist->current->nextitem) 
if  (current->nodelist->current->eIement->nodeid==nid) 
return  current->nodelist->current->element; 
return  NULL; 

I; 


void  cylinder::assign_indices(int  nid,int  index)  { 

gnode  *tql, 

*tq2; 

cnode  *tcql, 

*tcq2; 

Queueltem  *tempqueue; 
intlist  *inputqslst, 

*inqs. 
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*outqs, 

*outputqslst; 

inputqslst=new  intlist; 
outputqslst=  new  intlist; 


tql  =  gnodelist->getnode(nid); 
if  (tql->ntype==instruction)( 
tcql=  getnode(nid); 

cout «  nid  «" . ”«index«endl; 

tcql->index  =  index; 
tcql->indexed  =  true; 
inqs=tql->getinputqueuelist(); 
outqs=tql->getoutputqueuelist(); 
for  (inqs->current=inqs->head; 
inqs->cunent; 

inqs-x:urrent=inqs->current->nextnode) 

inputqslst->addtc!ist(inqs->curTent->number); 

for  (outqs->current=outqs->head; 
outqs->cunrent; 

outqs->carrent=outqs->current->nextnode) 

outputqslst->addtolist(outqs->canent->number); 

for  (inputqslst->current  =  inputqslst->head;inputqslst->cunent;inputqslst->currer.t=inputqslst- 
>current->nextnode)  { 

tempqueue=gqueuelist->getqueue(inputqsist->current->number); 

tq2=gnodelist->getnode(tempqueue->nodein); 

if  (tq2->ntype==instruction){ 
tcq2=getnode(tempqueue->nodein); 
if(!(tcq2->indexed)) 
if  (tcq2->fmishexec>tcql->startexec) 
assign_indices(tcq2->nodeid,index+l); 
else 

assign_indices(tcq2->nodeid4ndex); 

I; 

I; 
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for  (outputqslst->current  =  outputqslst->head;outputqslst->current:outputqslst->current=outputqslst- 

>curTent->nextnode){ 

tempqueue=gqueuelist->getqueue(outputqslst->current->number); 
tq2=gnodelist->getnode(tempqueue->nodeout): 
if  (tq2->ntype==instruction){ 
tcq2=getnode(tempqueue->nodeout); 
if((!(tcq2->indexed))lltcq2->index>=tcql->index) 
if  (tcq2->startexec<tcql->finishexec) 
assign_indices(tcq2->nodeid.index- 1 ): 
else 

assign_indices(tcq2->nodeid.index): 

1; 


l: 


cnode*  cylinder::find_latest(double  t){ 
double  max=0.0; 
cnode*  temp; 

for  (current=head;current;current=current->nextslice) 
for(current->nodelist->current=current->nodelist->head; 
cuncnt->n3delist->current; 

curTent->nodeIist->current=current->nodelist->current->nextitem )  { 
if  (t==0)( 

if(current->nodelist->current->element->Fmishexec  >  max){ 
max  =  current->nodelist->current->element->fiinishexec; 
temp  =  current->nodelist->current->element; 


if((cuirent->nodelist->current->elernent->finishexec  <=  t)&& 
(current->nodelist->current->element->finishexec  >  max))) 
max  =  current->nodelist->curTent->element->finishexec; 
temp  =  current->nodelist->current->element; 


if  (max=0) 

for  (current=head;current;current=curTent->  nextslice) 
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for(cutrent->nodelist->current=current->nodelist->head; 

cunent->nodelist->current; 

current->nodeIist->current=current->nodelist->curTent->nextitem) 
if(current->nodelist->current->element->finishexec  >  max){ 
max  =  current->nodelist->current->eIement->finishexec; 
temp  =  current->nodelist->cmrent->element; 

I: 


return  temp; 

I; 


cnode*  cylinder::fmd_latesf_start(douhle  t){ 
double  max=0.0; 
cnode*  temp; 

for  (current=head;currcnt;current=current->nextslice) 
for(current->nodelist->current=current->nodelist->head; 
current->nodelist->curreni; 

current->nodelist->current=current->nodelist->curTent->nextitem)( 
if  (t==0){ 

if(current->nodelist->current->element->startexec  >  max)| 
max  =  current->nodelist->cuiTent->element->startexec; 
temp  =  current->nodelist->current->element; 


else{ 

if((current->nodelist->current->eIement->startexec  <  t)&& 
(current->nodelist->current->element->startexec  >  max)){ 
max  =  current->nodelist->curTent->element->startexec; 
temp  =  current->nodelist->current->element; 

1; 

); 

I; 

return  temp; 

I: 


void  cylinder:  :start_after_finish(int  gnum){ 
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int  i, 

j: 

intlist*  nolist; 
cnode  *tempcnodel, 

*tempcnode2; 
arcnode*  temparc; 
arclist*  arclst: 
fstream  arcdata.datafile: 
arcdata.open("tokens".ios:  :out); 
arclst  =  new  arclist; 
nolist  =  new  intlist; 

for  (current=head;current;carrent=curTent->nextslice) 
for(current->nodelist->cun'ent=curTent->nodelist->head; 
current->nodelist->curTent; 

current->nodelist->current=current->nodelist->current->nextitem) 

nolist->addtolist(current->nodelist->current->element->nodeid): 

for  ( nolist->current=nolist->head;nolist->current; 
nolist->cun-ent=nolist->current->nextnode){ 
tempcnodel=getnode(nolist->current->number); 
i  =  fempcnodel-i.  index; 

tempcnode2=findJatest(tempcnodel->startexec); 
j  =  tempcnode2->index; 

iff!  (arclst->is_already_exist(tempcnode  1  ->nodeid.tempcnode2->nodeid)))  | 
if  ((tempcnodel->staitexec==0)ll 
(tempcnode2->finishexec==cyl_circum)) 

J=J-  I: 

temparc  =  new  arcnode; 
temparc->sourcenodeid=tempcnode2->nodeid; 
temparc->sinknodeid  =tempcnodel->nodeid: 
if(i>=j)( 

temparc->initiai_length=i-j; 
temparc->threshold  =  1; 
temparc->consumption=  1; 
temparc->production=  1 ; 
temparc->capacity  =  100: 

I 

else 

if(i<j)( 

temparc->initialjength=0; 
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temparc->threshold  =j-i+l 
temparc->consumption  =  1 : 
temparc->production  =1; 
temparc -capacity  =100; 


1: 

arclst->addtolist(temparc ): 

I; 


gnum++; 

datafile.open("simdata'\ios::outlios::app); 
for  (arclst->current=arclst->head; 
arclst->current; 

arclst->current=arclst->CLirTent->nextitem){ 
datafile  «  gnum«  “ 
datafile  «  0  «  “ 

datafile  «  arclst->current->eletnent->sourcenodeid  «  “ 
datafile  «  arclst->current->element->sinkncxleid  «  “ 
datafile  «  aiclst->current->eletnent->threshold  «  “ 
datafile  «  arclst->current->eletnent->initial_length  «  “ 
datafile  «  arclst->current->elenient->consumption  « ** 
datafile  «  100«  endl; 

gnum++; 

arcdata  «  arclst->current->element->sourcenodeid  «  “ 
arcdata  «  arclst->current->element->sinknodeid  «  “ 
arcdata  «  arclst->current->eleniem->initial_length  «  “ 
arcdata  «  arclst->current->element->threshold  «  “ 
arcdata  «  arclst->current->element->consumption  «  endl; 

1; 

arcdataxloseO; 

datafile.closeO; 


void  cylinder::stait_after_start(int  gnum){ 
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int  i. 

j: 

intlist*  nolist; 
cnode  *tempcnodel, 

*tempcnode2; 
arcnode*  temparc; 
arclist*  arclst; 
fstream  arcdata,datafile: 
arcdata.open(“tokens"  ,ios:  -.out); 
arclst  =  new  arclist; 
nolist  =  new  intlist; 

for  (current=head;current;curTent=current->nextslice) 
for(current->nodelist->current=current->nodelist->head; 
current->nodelist->current; 

current  >nodelist->current=curren:->nodelist->curTent->nextitem) 
nolist->addtolist(current->nodelist->current->element->nodeid); 

for  (nolist->current=nolist->head;nolist->current; 
nolist->current=nolist->current->nextnode){ 
tempcnode  1  =getnode(nolist->current->number); 
i  =  tempcnode  l->index; 

tempcnode2=find_latest_start(tempcnode  1 ->startexec); 
j  =  tempcnode2->index; 

if(!(arclst->is_already_exist(tempcnodel->nodeid,tempcnode2->nodeid))){ 
if  (tempcnodel->startexec==0) 

j=j- 1; 

temparc  =  new  arcnode; 
temparc->sourcenodeid=tempcnode  l->nodeid; 
temparc ->sinknodeid  =tempcnode2->nodeid; 
if(«>j){ 

temparc->initial_length=0; 
temparc->threshold  =  1; 
temparc->consumption  =  1; 
temparc->production  =  1; 
temparc->capacity  =  i-j; 

I 

else 

if(i<j){ 

temparc ->initial_length=j-i+ 1 : 
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temparc->threshold  =1; 
temparc->consumption  =1; 
temparc->production  =  1 : 
temparc->capacity  =1; 
t 

else) 

temparc->initial_length=  1  ; 
temparc->threshold  =1; 
temparc->consumption  =1: 
temparc->production  =  1 ; 
temparc->capacity  =1: 

}; 

arclst->addtolist(temparc); 


gnum++; 

datafile.open(“simdata”,ios:  :outlios: :  app); 
for  (arclst->current=arclst->head; 
arclst->current; 

arclst->curTent=arclst->current->nextitem){ 
datafile  «  gnum«  “ 
datafile  «  0  «  “ 

datafile  «  arclst->current->element->sourcenodeid  «  “ 
datafile  «  arclst->current->element->sinknodeid  « '* 
datafile  «  arclst->current->element- threshold  «  “ 
datafile  «  arclst->current->eletnent->initialjength  «  “ 
datafile  «  arclst->current->element->consumption  «  “ 
datafile  «  arclst->current->element->capacity«  endl: 

gnum++; 

arcdata  «  arclst->cuiTent->element->sourcenodeid  «  “ 
arcdata  «  arclst->current->element->sinknodeid  «  “ 
arcdata  «  arclst->current->element->initial_length  «  “ 
arcdata  «  arclst->current->element->threshold  «  “ 
arcdata  «  arclst->ctirrent->element- consumption  «  endl; 
I; 

arcdata.closeO: 

datafile.close(); 
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>: 


void  cylinder::find_RC_arcs(int  gnum)| 
int  choice: 

cout  «  "CHOICE  ONE  OF  THE  FOLLOWINGNnSn" 
cout «  “1.. ST  ART  AFTER  FINISH  (S  AF)\n": 
cout «  "2  .START  AFTER  START  (SAS)\n\n”: 
cout «  "Choice  : 
cin  »  choice; 
if  (choice  ==  1) 

start_after_fmish(gnum); 

else 

start_after_start(gnum); 


main()( 

int  numassignednodes. 
temp, 
dummy, 
loop, 
loop2; 
mlist  *  ml; 
ml  =  new  mlist; 
plist  *  pi; 
pi  =  new  plist; 
gnodelist  =  new  nlist; 
gqueuelist=  new  Qlist: 
gnode  *  tempnode: 
cnode  *  tempcnode; 
cylinder-slice  *  tempslice; 
clist  *  templist; 
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cylinder  *c  =  new  cylinder; 
int  gnum; 

fstream  <iatafile.config,cyldata,pic.mem; 

datafUe.open(“simdata",ios::in); 

config.open(“machine",ios::in): 

cyldata.open(‘‘cyl.map",ios::in); 

mem  .open(“memodules",ios:  :in); 

nummemorv  =  ml->loadmemory(config); 

numprocs  =  pl->loadprocessors(config); 

dummy  =  gnodelist->loadnodes(datafile.mem,5); 

gnum=gqueuelist->loadqueues(datafile,mem.gnodelist); 

cout «  *gnodelist: 

datafile.closeO; 

mem.close(); 

gnodelist->sort_topologically(gqueuelist); 

tempnode  =  new  gnode; 
for  (loop  =  0;  loop  <  numprocs:  loop++){ 
cyldata  »  numassignednodes; 
tempslice  =  new  cylinder.: slice; 
templist  =  new  clist: 

for  (loop2  =  0;loop2  <  numassignednodes:Ioop2++){ 
cyldata  » temp; 
tempnode  =  new  gnode; 
tempcnode=  new  cnode; 
tempnode  =  gnodelist->getnode(temp); 
tempcnode->nodeid  =  tempnode->nodeid; 
tempcnode->order  =  tempnode->order; 
tempcnode->exectime=  tempnode->exectime; 
templist->insert(  tempcnode) ; 


temps  lice->slicenum  =  Ioop+1; 
tempslice->nodelist  =  templist; 
i  temps  lice->nextslice=  NULL; 

if  (c->head  =  NULL) 
c->head=c->current=tempslice: 
else{ 

c->current->nextslice  =  tempslice; 
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c->current  =  tempslice; 


l: 

cyldata»cyl_circum; 
cyldatacloseO; 
double  time: 

for  (c->current=c->head;c->current;c->current=c->current->nextslice)| 
time  =0.0; 

for(c->current->nodelist->current=c->current->nodelist->head; 

c->current->nodelist->current: 

c->current->nodelist->current=c->current->nodelist->current->nextitem)( 

c->cuxTCnt->nodelist->current->element'>indexed=false; 

c->current->nodelist->current->element->index=0; 

c->current->nodelist->current->element->startexec=time; 

time=c->current->nodelist->cuiTent->element->exectime+time; 

c->current->nodelist->current->element->ftnishexec=time; 

}: 


for  (c->current=c->head;c->current;c->cun'ent=c->cuiTent->nextsIice){ 
for(c->current’>nodelist->current=c->current->nodelist->head; 
c->current->nodelist->current: 

c->current->nodelist->current=c->current->nodeljst->currem->nextitem)| 
cout«c->current->nodelist->curTent->element->nodeid«  “  **; 
cout«c-x:urrent->nodelist->current->element->index«" 
cout«  c->current->nodelist->current->element->startexec«“ 
cout«  c->current->nodelist->current->element->rmishexec«" 

I; 

cout «  endl«endl; 

); 

c->assign_indices(c->head->nodelist->head->element->nodeid.O); 

c->fmd_RC_arcs(gnum); 

cyldata.open(“cyl.map"jos::out); 

pic  .open(“pic  ture”  jos::out); 

int  level  1=0: 

int  ait=l; 

int  previous  =  0; 

for  (c->cuiTent=c->head;c->current;c->cunent=c->curTent->nextslice){ 
for(c->current->nodelist->cun'ent=c->current->nodelist->head; 
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c->current->nodelist->currem; 

c->current->noddist->current=c->current->nodelist->current->nextitem)l 

pic«previous«“ 

if  (alt) 

pic«  level  l«endl: 
else 

pic«  level  l+100«endl: 

pic«int(c->current->nodelist->current->element->fmishexec/1000)«‘'  “ 
if  (alt) 

pic«  level  l«endl; 
else 

pic«  levell+100«endl; 
if  (alt) 
alt  =  0: 
else 
alt  =  1; 

cyldata«c->current->nodelist->current->element->nodeid«  " 
cyldata«c->current->nodelist->current->element->index«“ 
cyldata«  c->current->nodelist->cuiTent->element->startexec«“ 
cyldata«  c->current->nodelist->cuiTent->element->finishexec«“ 
previous  =int(c->cuiTCnt->nodelist->current->element->finishexec/1000): 
cyldata«“ 

I; 

previous  =  0; 
if(alt) 

pic  «  0  «  “  “«levell+100«  endl; 
else 

pic  «  0  «  “  “«levell  «  endl; 
if  (alt) 
alt  =  0; 
else 
alt  =  1: 

level  1  =  level  1  +  100; 
cyldata«endl«endl; 

I: 

cyldataxloseO; 
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APPENDIX  D:  Interrelation  of  the  Files  in  PIPDAFS  and  GR 
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